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Abstract. No. In a number of papers Green and Wald argue that the standard 
FLRW model approximates our Universe extremely well on all scales, except close 
to strong field astrophysical objects. In particular, they argue that the effect of 
inhomogeneities on average properties of the Universe (backreaction) is irrelevant. We 
show that this latter claim is not valid. Specifically, we demonstrate, referring to 
their recent review paper, that (i) their two-dimensional example used to illustrate 
the fitting problem differs from the actual problem in important respects, and it 
assumes what is to be proven; (ii) the proof of the trace-free property of backreaction 
is unphysical and the theorem about it fails to be a mathematically general statement; 
(iii) the scheme that underlies the trace-free theorem does not involve averaging and 
therefore does not capture crucial non-local effects; (iv) their arguments are to a 
large extent coordinate-dependent, and (v) many of their criticisms of backreaction 
frameworks do not apply to the published definitions of these frameworks. It is 
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therefore incorrect to infer that Green and Wald have proven a general result that 
addresses the essential physical questions of backreaction in cosmology. 

PACS numbers: 04.20.-q, 04.20.-Cv, 95.30.-k, 95.36.+x, 98.80.-Jk 


1. General context 

Backreaction, i.e., the effect of inhomogeneities in matter and geometry on average 
cosmic evolution, has been studied in the cosmology community from various 
perspectives (see the reviews [1-8] and references therein). A series of influential papers 
by Green and Wald [9-12] have led many to believe that these effects are irrelevant or 
highly constrained. Those papers are an interesting contribution to the research field 
of backreaction effects in relativistic cosmology, but in this paper we demonstrate that 
the strong claims advanced by Green and Wald about the irrelevance of backreaction 
are unproven. 

In an earlier paper, Ishibashi and Wald [13] claimed that backreaction effects 
are negligibly small on all scales, except “in the immediate vicinity” of strong 
held astrophysical objects. This statement was based on the smallness of metric 
perturbations, despite the presence of large density perturbations [14, 15]. This 
conclusion evolved in subsequent papers [9-12], where it was stated that backreaction 
effects can in principle be large (due to the fact that derivatives of metric perturbations 
are large [16]), but they can in fact only contribute a radiation-like trace-free term to 
the effective stress-energy tensor. In this paper we will give evidence that backreaction 
effects can be significant and show that they need not be trace-free. 

We organize this paper in line with the presentation of arguments in the recent 
Green and Wald overview [12], and refer by the abbreviation GW to the statements 
made therein. In section 2 we consider the heuristic example of modelling a polyhedron 
with a sphere discussed by Green and Wald in [12]. We argue that this two-dimensional 
example misses essential features of backreaction. In section 3 we discuss the claim by 
Green and Wald that the contribution of backreaction to the effective stress-energy 
tensor is trace-free, and note that a non-trace-free part of backreaction is well- 
established; here we also comment on problems with the weak-limit procedure and 
its problematic relation to spatial averaging. In section 4 we discuss some relevant 
examples. In section 5 we comment on some misinterpretations related to scalar 
averaging, and in section 6 we put the discussion by Green and Wald of the relation 
to observations into perspective. We conclude in section 7. Appendices are devoted to 
technical aspects. 

2. An example of a fitting problem 

We start with the heuristic example in GW [12]. They state that this solves a fitting 
problem. The fitting problem is raised and explained in Refs. [17-19] and expressed 
in [20] as “How do we determine what is the best FLRW background model for the real 
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lumpy universe?”. In GW’s section 4.3 they argue for a parameter fitting approach in 
which it is a priori assumed that the best cosmological model is an FLRW model with 
no backreaction. By making a similar assumption in their introduction they conclude 
“Thus, the ‘htting problem’ is trivially solved”. From our perspective, it has not been 
satisfactorily answered, in that GW have a priori excluded the possibility that the best 
FLRW model might be one with backreaction corrections. (The real Universe might 
even be best modelled by some other geometry altogether). 

2.1. Sphereland or Flatland? 

The GW example is of the physical spherical surface of a “ball bearing” with all its “tiny 
defects”, and an a priori assumed £t by the standard round geometry of the two-sphere. 
As a model of the actual surface they take a polyhedral sphere (S^, g^oi) and consider 
how it is approximated by the (model) metric of a standard round sphere (S^, gcan) 
(“can” for “canonical”). The analogy they wish to draw is of an approximately FLRW 
metric such that the difference of the real and model metrics obeys the assumptions 
in their section 2. We discuss these assumptions and their consequences at length in 
subsection 3.4 and in Appendix B. 

Green and Wald assume (S^, gpoi) to be a convex polyhedron. However, convexity 
is not an intrinsic characteristic of (§^, gpoi). It comes about only when we consider 
(S^, gpoi) embedded into Euclidean space. The assumption of convexity involves extrinsic 
curvature, whereas the htting problem is concerned with intrinsic curvature (as it must 
be in the cosmological setting). 

Green and Wald discuss intrinsic aspects, by referring to E. Abbott’s Flatland 
[21]. They argue that when two-dimensional observers in ‘Sphereland’ “use triangles 
that are large compared with the distance between vertices, they obtain results that 
are reasonably consistent with spherical geometry. Nevertheless, even when looking 
at phenomena on large scales, the observers find some disturbing anomalies when 
attempting to model Sphereland by a perfect sphere” and conclude that “As a result of 
these observations, the observers might be tempted to conclude that the perfect sphere 
model provides a reasonably good description of Sphereland on large scales—although 
with some signihcant deviations—but an extremely poor description of Sphereland 
on small scales. However, the actual situation, of course, is that the metric, gab, of 
Sphereland is everywhere extremely close to the metric, g^^f^, of a perfect sphere.” 

The analogy is that cosmological observers may be similarly misled into thinking 
an FLRW model (without backreaction) was a poor description. This was made very 
clear in the recent note [22] responding to the hrst draft of the present paper. 

As quoted above. Green and Wald consider the Spherelanders performing geodesy 
with larger and larger triangles, thus exploring the (global aspects of the) intrinsic 
geometry of (§^, gpoi). By this means Spherelanders might, as GW suggest, reach the 
apparently obvious conclusion that gpoi is indeed approximated by the round metric 
gcan, with some small-scale anomalies. However, the measurements of the geometry 
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of triangles proposed by GW do not measure the metric, they measure curvature 
(integrated over the interiors of the triangles). Knowing the curvature and a hnite 
number of its derivatives does locally specify a metric uniquely, but at the present time 
we do not know a way to use this information to determine a best-£t approximate 
metric: we discuss these points in subsection 5.4. 

More relevantly, it is the “signihcant deviations” on small scales that are the 
important feature. Let us consider the steel ball example in more detail. The metric of 
each two-cell in the polyhedron (§^, g^oi) is not curved, but flat. Green and Wald recall 
that the ball’s (Gaussian) curvature is represented by a Dirac measure with support at 
the vertices of (§^, Ppoi), and geometrically represented by vertex deficit angles measuring 
excess or defect with respect to the 2tt vertex angles of a flat geometry. The latter 
implies, however, that the geometry of (§^, g^oi) is almost-everywhere flat with curvature 
defects supported at the vertices: the relevant background metric of (§^, g^oi) is thus 
not a spherical metric, as intuition and Figure 2 in GW’s paper might suggest, but a 
flat metric (with conical singularities representing curvature). 

To illustrate what knowing the curvature, as the Spherelanders might, means in 
the present setting, note that we may have positive curvatures as well as negative 
curvatures “sprinkled” over the vertices of (§^, gpoi)- The crucial point is that the 
“sprinkling” cannot be random: the amount of overall positive and overall negative 
curvature over the vertices must comply with the Gauss-Bonnet theorem applied to 
(§^, gpoi) [23], [24], [25]; the integrated Gaussian curvature must sum up to dvr, the 
Euler number x(§^) = 2. Thus, any two-surface which is topologically a sphere will 
have the same curvature, on average, as a round sphere. Only by measuring over the 
whole surface would Spherelanders End the correct total curvature, and anyway they 
would arrive at the same result (assuming the same topology) regardless of whether or 
not the coordinate components^ of the metric were close, either in pointwise value, or 
in some other suitable norm, to those of the round sphere. 

2.2. The consequences for the fitting and backreaction problems 

To emphasize the point, we may easily construct a large (let it be convex) polyhedral 
surface (S^, g-poi)-i (using a dual metrical triangulation), where thousands of vertices are 
flat and just a few carry some (tiny) dehcit angles. For the Sphereland inhabitants 
it would be very hard—perhaps impossible—to conclude that they actually live on a 
two-sphere. By probing rather large portions of their ambient space (analogous, say, to 
the observable Universe considered as part of an even larger spacetime) they would be 
more likely to conclude, potentially with an arbitrarily high degree of precision, that the 
geometry they live in is flat (there is no way they could intrinsically detect the edge¬ 
bending between the flat regions). Thus, guided by common sense and by such empirical 
evidence, they would probably consider as irrelevant to their cosmological modelling the 
local defects associated with a few dehcit angles in an ocean of hatness. However, it is 

^ We pick up the issue of the use of coordinate components in comparisons in subsection 5.4. 
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these local defects that determine the round sphere best-£t background, because it is the 
Euler number that determines the global properties of a two-dimensional surface. This 
is the only datum which would allow Spherelanders to draw GW’s desired conclusions 
on the best-fitting metric in this analogy, and it is thus the curvature defects that 
constitute the only relevant property. 

We remark that in the mathematical literature this issue has been treated in 
much greater depth than GW provide, giving more detailed presentations of polyhedral 
manifolds and of the resulting approximation techniques, as indicated in Appendix A. 

The steel ball analogy cannot say much about backreaction effects, especially on 
their evolution, since the topological constraint makes any two-dimensional example 
trivial. Because of the Gauss-Bonnet theorem, there is a conservation law for 
the integrated intrinsic curvature in two dimensions [8, 26], just as there is such a 
conservation law in the standard FLRW cosmological model in 3 + 1 dimensions. As 
a consequence, if one imagined the two-dimensional polyhedral model as representing 
the spatial surfaces in a three-dimensional spacetime, the curvature evolution would be 
decoupled from backreaction. 

Generally, backreaction effects may become strong when inhomogeneities 
dynamically couple to the averaged intrinsic curvature (which does not, in general, admit 
a conservation law in three dimensions [27]). This averaged curvature-backreaction 
coupling only arises in more than two dimensions (for details on the coupling of 
inhomogeneities to curvature and conservation laws, see [1]). 

2.3. Conclusion 

We have argued that the htting problem cannot be solved by declaring, dictated by 
common sense, that the best-fitting metric is Qcan with localized defects. Gonsidering 
curvature fluctuations as being unconstrained and assuming that they average out to 
zero is tautological since this assumes that curvature defects globally contribute nothing 
and, thus, assumes that the geometry (§^, Pcan) is the best-htting geometry without 
actually characterizing the explicit map between (§^, ppoi) and (S^, Pcan) (for discussion 
of such a map see Appendix A). Moreover, the geometry the Spherelanders experience 
in their surroundings (the physical geometry) is the almost-everywhere flat not the 

everywhere round gcan- In particular, the measured physical curvature defects are with 
respect to (§^, Ppoi) and not with respect to (§^, gcan)- 

From this brief analysis we see why Green and Wald obtain a highly constrained 
result for the possible backreaction in the steel ball model. 

3. Is backreaction necessarily trace—free ? 

No. There are many papers using standard perturbation theory and exact solutions 
of the Einstein equations that conclude that the backreaction has a trace part (see the 
reviews cited in the introduction and the references therein). But Green and Wald claim 
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that within a “completely general framework” [12] the hackreaction is trace-free. 

In this section we discuss the trace component in hackreaction and show that GW’s 
theorems are unrelated to actual hackreaction. Before doing so we look at the situation 
in the Newtonian limit. 

3.1. Traces of hackreaction in Newtonian cosmology 

The term hackreaction, as it is usually understood in Newtonian or relativistic cosmology, 
may he generally dehned as deviation of spatial average properties of an inhomogeneous 
universe model from the values predicted hy a homogeneous-isotropic universe model. 

In Newtonian cosmology the average expansion rate is affected hy the non-local term [28] 




( 1 ) 


where 6, a and u are the rate of expansion, shear and vorticity for a given velocity 
model, respectively, and the brackets denote volume averaging on a spatial domain T>, 
an averaging that is well-dehned even for tensors in Newtonian theory. The magnitude 
of this term has been estimated in [29] (and follow-up papers) using well-known models 
for large-scale structure that are in good agreement with iV— body simulations down to 
scales where the dust approximation breaks down. 

Interpreted in the context of a FLRW htting model, hackreaction leads to an 
effective stress-energy tensor that has the form of a perfect fluid with the components 
(see, e.g. [30], Subsection 3.2): 



where aj) is the volume scale factor, dehned such that the volume of the averaging 


3 

•V 


[31]. The local stress-energy tensor satishes the weak 


domain is proportional to a: 


energy condition, since p > 0 (implying p + p > 0 as we only consider dust, with p = 0). 
However, the effective stress-energy tensor does not necessarily satisfy these conditions, 
and neither does its hackreaction part^. Also, the trace is not in general zero. Both of 
these features are in contradiction with GW’s claims, specihcally their Theorems 1 and 
2 [9,12], considered in the Newtonian limit. We shall come back to the energy conditions 
in subsection 5.2. 

Note that the magnitude of hackreaction depends on the scale of averaging. In 
particular, already in Newtonian cosmology it is signihcant on intermediate scales, below 
the cosmological homogeneity scale^, but well above the scales of “black holes or neutron 

^ This latter condition would artificially select the class of models for which peff — {q)t> — I* 

£»eff - (g)-!) +PeS>0 hold. 

^ E.g., in a domain with a radius of lOOMpc today and initially one-cr fluctuations, the density 
parameters deviate from their homogeneous values of an Einstein~de Sitter background by 15% [29]; 
for estimates in relativistic cosmology including a background with cosmological constant see [32]). 
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3.2. Traces of hackreaction in relativistic cosmology 

Since the trace part of hackreaction cannot turn on just in the Newtonian limit, 
hackreaction cannot be trace-free in general in GR either without violation of at least 
one of the theory’s principles. This remark makes the existence of trace parts clear, since 
the calculation in the Newtonian framework is unambiguous. Green and Wald emphasize 
in their abstract that “Newtonian cosmologies provide excellent approximations to 
cosmological solutions to the Einstein equations (with dust and a cosmological constant) 
on all scales.” If that is the case, then—especially in view of their “dictionary” [10]— a 
claim of trace-free hackreaction cannot be correct. 

General Relativity adds the new feature that this hackreaction term (represented by 
extrinsic curvature invariants here) is coupled to the averaged scalar curvature (77) [31], 

{Qv^v) = 0 . (3) 

This coupling is important for hackreaction in relativistic cosmology, as it can also affect 
the global geometrical properties. This latter is impossible in Newtonian cosmology and 
it is also suppressed in quasi-Newtonian relativistic perturbation theory with periodic 
boundary conditions due to the fact that a conservation law for the intrinsic curvature 
holds. 

3.3. The idea of Green and Wald and its realisation 

Green and Wald assume that the actual metric has the structure: 

9ab 9ab{0) + ; (4) 

where 5 ^( 0 ) (denoted by g^^'^ in GW) is a background metric assumed to provide the 
averaged background modelling the Universe (the indices are spacetime indices). Green 
and Wald stress that in most of their computations 5 ^( 0 ) can be any given background 
metric. 

Garrying out computations in such a general framework (and in particular trying to 
interpret them geometrically) is a daunting, if not impossible, task. We are not aware 
of any similar analysis in Riemannian geometry (where things are dehnitively easier 
than in the Lorentzian case). Hence, Green and Wald tackle a less ambitious task by 
considering a perturbative formalism where the symmetric tensor 7 , representing the 
deviations from the background, is small. However, they do not assume that the hrst 

^ We note that there are two cases in Newtonian cosmology in which this hackreaction term vanishes, 
both being the result of the flat geometry in Newtonian theory: first, it vanishes for spherically 
symmetric solutions (the content of Newton’s “theorem of the iron spheres”) [29] and, second, for scales 
where commonly periodic boundary conditions (a 3—torus topology) are imposed on the deviations off 
a homogeneous solution (a topological constraint as in the steel ball model) [28]. Thus, hackreaction 
vanishes globally, but not in the interior of this simulation box where it is significant on smaller scales. 
However, if we additionally perform a statistical average over many realisations (not considered by 
GW), this term vanishes in the statistical average due to the global constraint [29]. 
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derivatives of 7 are small and even allow second derivatives of 7 to be nnbonndedly 
large, to deal with very large flnctnations in cnrvatnre aronnd the ^'(O) backgronnd. 

We point ont already here, with detailed explanations in Appendix B, that the 
assnmption in Eqnation (4) together with the non-pertnrbative implications for the 
metric derivatives ontlined above, inherits similar problems to those we explained for 
the steel ball analogy. Their statement that “In snmmary, the geometry of Sphereland is 
described by a metric of the form qab{0)+Sab, where q'ab(O) is the metric of a perfect sphere 
and I Sab I ^ kafe( 0 )|, bnt hrst derivatives of Sab are not small, and second derivatives of Sab 
may be enormons. In an exactly similar manner, the spacetime metric of onr Universe 
takes the form pab = Pabi^) + Jab, where Pab{0) has the FLRW symmetry ...” [12] is 
problematic. Modelling the pertnrbation jab with the same nnderlying rationale as that 
for the pertnrbation Sab of the steel ball example implies that the potentially nnbonnded 
second derivatives of the metric may give rise to curvature distributional defects in the 
weak-limit, cf. Appendix B, that cannot be dismissed, since they are as crucial as the 
curvature defects in the steel ball example. 

The scheme adopted by GW adds a stress-energy tensor to a scheme developed by 
Burnett [33] for vacuum spacetimes, which replaces the averaging operation in Isaacson’s 
approach [34,35] by a weak-limit procedure. (Burnett [33] hnds a trace-free effective 
stress-energy tensor by applying a weak-limit scheme to gravitational waves, which 
are physically trace-free. He also discussed a non-vanishing stress-energy-tensor, but 
for the case of electromagnetic radiation, which is again physically trace-free.) For 
another calculation of backreaction in the weak-limit of gravitational waves emitted 
from a Schwarzschild black hole see Ref. [36]. We shall demonstrate that the weak-limit 
operation does not implement a true averaging procedure, and that the assumptions 
made by GW are too restrictive to allow for backreaction. 

3.4- The weak-limit scheme 

Green and Wald’s weak-limit scheme is technically realised, as in the familiar 
perturbation formalism for GR, by assuming that there is a class of coordinate systems 
{x“} and a one-parameter family of Lorentzian metrics (0,1] 3 A 1 —> 5 '(A), such that 
we can locally write the metric components PabW as 

(0, 1] 3 A I- y Pab{^) = gab{0) + 7ab{^) ■ (5) 

These metric components are further required to satisfy the following assumptions: (cf. 
(z)-(in) of GW): 

{i) For all A G (0,1] one assumes that the Finstein equations hold, 

GabigW) + Afi'ab(A) = SnTabiX) , ( 6 ) 

where Tab (A) is a one-parameter family of stress-energy tensors that locally satisfy the 
weak energy condition. 
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{ii) There exists a smooth positive function Ci{x) on (M, ( 7 ( 0 )) such that we may write^ 

| 7 ab( 2 :, A)| < \Ci{x) . (7) 

This is the usual way of controlling the perturbing tensor 7 in standard perturbation 
theory, implying that, as A \ 0, g{\) approaches (/(O). 

{Hi) There exists a smooth positive function C 2 {x) on (M, ( 7 ( 0 )) such that we may write 

|Vc 7 afe(a;, A)| < C2{x) , ( 8 ) 

where V denotes the covariant derivative w.r.t. (M, 77 ( 0 )). This requirement does not 
imply that the hrst derivatives of 7 are small and they do not necessarily become small 
as A \ 0. 

{iv) Finally, they assume that there exists a smooth tensor held fVabcdef{x) on (M, ( 7 ( 0 )) 
representing the weak-limit of the product of hrst covariant derivatives V 7 (A)V 7 (A) as 
A \ 0, he., 

W-lim \Va7cd{x, X)Vb^ef{x, X)\ = piabcdef{x) • (9) 

A ^0 

Here, the weak-limit is intended in a strict weak sense, he., requiring that the functions 
k-abcdef are locally summable (see GW, Eqs. (5) and ( 6 )). (It should be noted that this 
is more restrictive than the usual distributional interpretation of the weak-limit, where 
Habcdef could be represented by a singular distribution. We examine this issue in depth 
in Appendix B). This condition forces good behaviour of the potentially problematic 
quadratic product V 7 (A)V 7 (A) as A \ 0. 

The condition {iv) is not motivated by any transparent and natural physical 
argument: as it stands we do not consider {iv) to be a robust hypothesis underlying 
backreaction analysis. Even for the high-frequency limit regime of gravitational waves 
there are physically transparent alternative approaches to this that do not need to 
assume condition {iv), see [37], Ch. Ill, paragraph 12 and Ch. XI, (this reference is also 
interesting for the issue of gauge dependence, discussed further below), and also [38-40] 
and references therein. 

3.5. Green and Wald’s one-parameter family of metrics and metric flows 

A delicate issue with GW’s perturbation approach is the validity of assumption {iv) 
together with assumption (i) that the Einstein equations are required to be satisfied 
along the curve of metrics, for all X except in the limit A \ 0. Thus, backreaction is 
supposed to turn on just in the limit (see subsection 3.7 for details). This situation will 
turn out to be problematic. 

We hrst point out an issue that arises in taking due care of the subtleties generated 
by the potentially unbounded second derivatives of the metric huctuations. There may 
be particular spacetimes for which their procedure can be carried out, but typically 
one would get distributional curvature tensors under the GW hypotheses in the limit 

^ Green and Wald use a fixed A-independent Riemannian metric for computing norms of the tensor 
field involved and of their covariant derivatives, cf. Equations (5) and (6) of GW. 
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where A \ 0, which makes their effective stress tensor tab(O), cf. subsection 3.7, just 
a formal expression, not containing functions but distributions, which are not easily 
interpreted from the geometrical and physical point of view. In particular the trace- 
free conditions would generally be false. Even it were possible to get rid of any non¬ 
regular distributional part, as GW implicitly assume, their approach strongly relies 
on a very delicate non-uniform boundedness (in A) hypothesis on |VcVd 7 afe(ir, A)| (see 
the discussion surrounding Eq. (B.19)). This feature is imported from Burnett’s high- 
frequency limit in gravitational wave theory, and it is precisely this non-uniformity that 
generates a backreaction only in the A \ 0 limit. However, whereas in gravitational 
wave theory a non-uniform high-frequency limit is not a surprising feature of oscillating 
phenomena, in cosmology it does not correspond to the physical situation. In the 
real Universe, the matter distribution does not oscillate on arbitrarily small scales, but 
there is a hierarchy of finite averaging length scales that are physically dehned by the 
gravitational systems we want to average. It is important to stress that if we remove 
this specihc non-uniformity requirement from the GW formalism, for example instead 
requiring the second derivatives of 7 to be uniformly bounded in A, then the effective 
stress-energy tensor tab{0) vanishes. We give the proof in Appendix B. Moreover, in 
a real averaging procedure, backreaction terms must already be present for non-zero A. 
We shall explain this remark in the following two subsections. 

We also note that known and well-controlled metric flows as investigated in the 
mathematical literature on cosmological backreaction are generally not required to 
satisfy the Einstein equations during metric deformations. Metric flows are typically 
(weakly) parabolic and, hence, in general they do not commute with the Einstein 
evolution which is (weakly) hyperbolic with elliptic constraints; ‘weakly’ here refers 
to the fact that these flows must comply with the appropriate form of equivariance 
under the action of the diffeomorphism group (see Ref. [41]). Geometric flows are often 
gradient flows (with respect to some Hilbertian positive dehnite inner product), whereas 
the Einstein evolution is Hamiltonian. In particular, in a cosmological setting, rescaling 
flows (such as the Ricci-Hamilton-Perelman flow) work in a 3D hypersurface, rescaling 
the metric in the direction of the spatial Ricci tensor (see Ref. [42]), which a priori is 
unrelated to the Einstein flow that deforms the initial data set along the (symplectically 
conjugated) linearization of the Einstein constraints in the direction of the lapse and 
shift. In other words, the former relate to flows in space and the latter to time evolution. 

3.6. Averaging without averaging? 

While Green and Wald aimed at formulating the weak-limit formalism in a 
mathematically “completely general” manner, they noted that “it would be extremely 
difficult to formulate mathematically precise criteria for the validity of applying our 
formalism to a given spacetime”. They argued that they simply “resort to plausibility 
arguments to obtain [their] conclusions” regarding the applicability of the formalism 
to the real Universe [9]. They then argued that it is plausible that the weak-limit 
corresponds to spacetime averages over local regions, provided that the size of the regions 
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corresponds to the homogeneity scale, which is much smaller than the curvature scale 
of the background universe. However, arguments about limits that may be plausible for 
smooth functions do not immediately translate to singular distributions. 

Formally, GW are not averaging the fluctuations represented by the tensor 7 . They 
consider the weak-limit in a singular perturbation scheme for GR around a FLRW 
spacetime (or whatever geometry g{0) represents). In their words (GW): “The notion 
of ‘weak-limit’, which appears in assumption {iv), corresponds roughly to taking a local 
spacetime average, and then taking the limit as A \ 0”. This is not averaging in the 
sense used in discussing backreaction. The weak-limit of a sequence of smooth functions 
is typically a (singular) distribution. An example (on a compact Riemannian manifold) 
is provided by the heat kernel: K{x, y]t), t > 0. This is for all f > 0, so smooth that 
its convolution against nasty functions tames all their defects. Nonetheless the weak- 
limit of K{x,y,t), as f \ 0, is a singular distribution, the Dirac measure 6y supported 
at the point y. The operation might correspond to averaging if the compactly supported 
function that GW used to dehne the weak-limit had a form of translation symmetry 
built in (say, 0(x — y) in M"', or if we consider a large symmetry group, as in FLRW), so 
that one is actually regularising (the distribution) via a convolution product, not just a 
generic weak-limit. 

Green and Wald consider a singular perturbation theory around FLRW, not 
averaging or taking convolutions. This strategy is quite common in quantum held 
theory (QFT) where one needs to tame the spectrum of huctuations of a quantum held 
around a given background conhguration and describe how these huctuations may dress 
the bare background®. 

Dismissing the potential distributional nature of curvature huctuations in such a 
setting is akin to the situation we encountered in the steel ball example. The analogy 
is (i) here the background metric (M, (/(O)) plays the role of the almost-everywhere hat 
background metric in the steel ball model; and (ii) the distributional part plays the role 
of the physical curvature huctuations described by the conical vertices in the steel ball 
model. As in that case, there is no reason why they should disappear when performing 
a real averaging procedure. (In the two-dimensional steel ball model the Gauss-Bonnet 
theorem implies that they cannot disappear.) 

Glearly, there is nothing wrong in addressing the backreaction issue by studying a 
singular perturbation scheme around FLRW as GW do. It is a potentially useful idea. 
After all, in relativistic cosmology, unlike the averaging of scalars (which we will come 
to below), the averaging of tensors is not yet well-established, at least in a rigorous 
sense. The real point is to understand what you can get out of such an approach, how 
it is technically realised, and how it can be physically interpreted. 

We hnally remark that, even if not gauge invariant, a singular limit may be given 
physical sense locally, e.g., where Newtonian spacetime is obtained as a singular limit 
of a sequence of Einstein spacetimes (dehned by appropriate initial data expressing the 

® For the difference between dressed and bare backgrounds in cosmology see Ref. [43]. 
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3.7. Backreaction from no hackreaction? 

Even if one were to extend the GW perturbation scheme to rigorously calculate the 
trace from the distributional curvature using the procedure outlined in Appendix B, 
there is still another issue: Green and Wald assume the Einstein equations to hold for 
all A, except for A = 0. This is shown in their assumption (i), recall Equation (6): 

Gabigi^)) + ^9abi^) = Stt Tab(A) ; A G (0, 1] , 

and in their terminology for the limit (GW, Equation (1)): 

Gabigi^)) + Agab{0) = StT {Tab{0) + tab{0)) . 

Green and Wald refer to the averaging and fitting problems as they were raised in 
Ellis [17] and Ellis and Stoeger [18] (GW’s Figure 1 is reproduced from [17]). Those 
references emphasise that averaging or smoothing the inhomogeneous metric and taking 
derivatives (and their products) are in general non-commuting operations, which implies 
that, in general, the Einstein equations do not hold at any smoothing level'. In other 
words, although the actual model obeys the Einstein equations, there need not be a one- 
parameter family of solutions approaching the background that also obey the Einstein 
equations. This issue lies at the basis of backreaction, and the first-principle argument 
of [17] can be realised, e.g., with the scalar averaging formalism discussed in Section 5, 
or (directly related to a smoothing procedure) with Ricci flow techniques [46]. 

Putting this argument formally and adopting GW’s notation, an averaging 
operation, however it is defined (and here specialized to a one-parameter family of 
metrics), leads to effective terms, symbolically put into the tensor Tab^'- 

GabigW) + Agab{X) = Svr (Tafe(A) + Tafe(A)) ; A G (0,1) . (10) 

Green and Wald correctly quote the starting point of Ellis’ argument [17] that the 
Einstein equations are assumed to hold in the unaveraged case (A = 1), but they then 
assume that the Einstein equations also hold for A < 1 down to the limit, where suddenly 
the backreaction term tab turns on®. 

^ See [45] for a summary and examples of non-commuting operations. 

® This term is interpreted as an effective stress-energy tensor, although it arises from smoothing the 
geometrical side. 

® In their recent note in response to the draft of this paper [22] Green and Wald claim that “backreaction 
in our [GW’s] formalism does not suddenly ‘turn on’ at A = 0”. This is in disagreement with the 
requirement that the Einstein equations are assumed to hold for all A 7 ^ 0 and the fact explained in 
the previous subsection that no averaging is involved. The statement that “... the dynamics of ( 7 a&( 0 ) 
accurately describes the (large scale) dynamics of gab{N) sufficiently small A” does not take into 
account the lack of continuity of the mapping gabW Gab(A), as we explain in Appendix B. Moreover, 
the claim that “for A > 0, backreaction terms are present, and are described to leading order by the 
second order Einstein tensor” is not to do with backreaction arising from averaging, but the difference 
Gab(A) — Gab(O) (see Eqs. (9)-(12) in [9]). If such terms were backreaction terms, then in gravitational 
perturbation theory every term of order higher than 0 should be considered as a backreaction term for 
the assumed background. 
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In the framework of the weak-limit scheme, starting from (10), the calculation of 
the weak-limit gives: 

+ w-lim 

A\0 

where the last term is the weak-limit of the A—dependent backreaction term, which is 
absent in GW’s calculation. We will give an argument in subsection 4.2 and a proof in 
Appendix B that the effective stress-energy tensor tab{0), the trace-free “backreaction 
term” of GW, is expected to vanish for a uniform convergence to the background metric. 
We now illustrate the above issues in terms of examples. 

4. Examples using the GW framework 

Green and Wald provide an example of a family of vacuum spacetimes in [11] (section 3), 
discussed in subsection 4.2 below. In this and our other examples we require, as before, 
Pabix, A) to be a solution to the Einstein equations for all A > 0 (GW’s assumption (i)), 
and, as A \ 0, we require convergence of gab{x, A) to a background metric gab{x, 0). 

We refer the reader also to Appendix G, where we comment on the example provided 
by Szybka et ah [47] and on another example by GW that they provided in [ 11 ] 
(section 4), both of which aim at including matter inhomogeneities. We are not aware 
of any example satisfying the GW conditions that does satisfactorily include matter 
inhomogene it ies. 

We hrst start with a classical example given by Geroch to demonstrate the 
coordinate-dependence of the limit of a one-parameter family of metrics. 

4-1. Coordinate-dependence of the GW framework: Geroch’s example 

It was emphasized by Geroch [48] that, if we restrict our attention to metrics 
parametrised by some A only, then the limit is not uniquely dehned. 

Geroch considers the family of parametrised Schwarzschild metrics: 

ds2 = (^1 - df2 -(^1- dr^ - r\de^ + sin^ ed<f^) , (12) 

where A = In this form there is no limit as A \ 0. Gonsider, however, the 

coordinate transformation: r = Ar, t = X~^t, p = \~^9. Then, our family of metrics 
takes the form: 

ds2 = (^A^ - dP - (^A^ - ^ dh^ - h\dp^ + A-2sin2(Ap)d02) , (13) 
and the limit exists - it is the Kasner metric with Kasner exponents —1/3, 2/3, 2/3: 

ds^ = —^d? + -dr^ — r^{dfP + p^d0^) . (14) 

r 2 

However, applying a different coordinate transformation, x = r + AqA"^, p = AqA^^^ 
(for an arbitrary constant Aq) to our original metric ( 12 ) and taking the limit A \ 0 
yields the flat Minkowski metric. 



Gab{g{d)) + A5f(0) — Svr ( Taf,(0) +tafe(0) 
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Geroch’s discussion illustrates that the outcome of the weak-limit procedure 
depends on the choice of coordinates. Moreover, it can be argued that instead of hxing 
the background metric, it should emerge from a coordinate-independent averaging or 
rescaling procedure. We hnally remark that the characterization by curvature invariants 
can be used to clarify Geroch’s example and avoid the coordinate dependence of his 
methods (see Ref. [49]). Such ideas could work whenever there is some identihable way 
of taking a weak-limit of the set of curvature invariants. 

The issue of gauge dependence has been addressed in Ref. [3]. A related discussion 
can be found in [50]. In this latter paper it is demonstrated that even the spacetime 
integrals of 4D scalars are in general gauge-dependent if the domain of integration 
is hxed. The domain should rather be dehned intrinsically: the behaviour of the 
domain under gauge transformations should compensate the effects of the same gauge 
transformations applied to the integrated object and, thus, the domain must be 
coordinate- and A-dependent (the latter requirement comes from the parametrisation 
of the metric and its volume element). Given such an x- and A-dependent domain, it 
would no longer be allowed to exchange the limits and the integration operators as is 
done by GW in the case of hrst derivatives of metric deviations. This will then imply a 
non-commutativity of the two operations and would give rise to backreaction terms (for 
the consequences of non-commutativity see Ref. [31] for the Einstein flow. Ref. [46] for 
three-dimensional metric flows, and Ref. [45] for a general discussion). Then, the weak- 
limit operator would match the authors’ expectations of hrst performing an average and 
then taking a limit. This A—dependence has also been emphasized in a recent paper by 
Kopeikin and Petrov [51]. 

We move now to another issue. As pointed out by Szybka [47], one of the 
key ingredients of the GW formalism is Burnett’s restriction that any A-dependent 
coordinate transformation must be reduced to the identity when A \ 0. This is essential 
and shows that the procedure cannot work for arbitrary A—dependent coordinate 
transformations and is therefore not covariant. The Geroch coordinate transformations 
would be excluded with this condition. However, even if we take such “allowed” 
coordinate transformations we have a path dependence in the weak-limit procedure 
as discussed in the next subsection. 

4-2. Path-dependence of the weak-limit procedure 

In Ref. [11] Green and Wald look at a subclass of vacuum Gowdy metrics on a torus 
with metric 

ds^ = + dt?’) + e"" (e'’d(T=‘ + e-''d(5=‘) , 


(15) 
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which depends on the function P{t, i9). A one-parameter family of metrics parametrised 
by a discrete parameter is chosen as follows: 

Pi, = ^ Jo(iVe-") sm{Nd) , (16) 

so that, like Asin(a;/A), for larger and larger N (corresponding to smaller and smaller 
A) it oscillates with shorter and shorter wavelength and smaller and smaller amplitude; 
the limit N ^ oo corresponds to A \ 0. 

The weak-limit gives a non-zero result only for functions such as f{x) = a;sin(l/a;) 
(as in example (4.2) of [11]), which have a singular limit as they oscillate ever more 
finely without limit as a: —?• 0. Any non-zero result depends essentially on taking the 
infinite limit of those oscillations as the homogeneous model is approached; it will be 
zero if they cease at any value of A, no matter how small. This is what is called in [3] 
an ultra-local limit, the oscillations must not even stop at atomic scales. 

But the path chosen to link gab{x, = 1) to Qabi^, 0) is arbitrary: there is no reason 
why the specific path of Eq. (16) is preferred over others. One might equally well choose 

Pn = or Pn = sm{Ni^) (17) 

A 

for example; these are both solutions of the field equations, and these paths will give 
^afe(O) = 0. The physical entity is the endpoint gab{x,N = 1); the intervening family is 
arbitrary, as is the parameter choice. 

The result obtained from Eq. (16) only has physical meaning if it gives the same 
answer for different such parametrisations. This procedure does not. 

One could try 

PN^jNMNe-nMN») , ( 18 ) 

and see for what f{N) one gets tafe(O) 7 ^ 0 as well as the amplitude A/f{N) going to 
zero. It is non-zero only for very special choices of f{N). 

We repeat that the problem is the ultra-local requirement that perturbations in 
the envisaged family continue down to indefinitely small wavelengths without limit; 
otherwise one gets a zero answer. For very small distances in realistic cosmological 
modelling the solutions should behave like {A/N) rather than like (A/A) sin(A'd) 
(equivalent to using /(x) = x for very small scales as x —)• 0 rather than xsin(l/x)). 
Then, the quantity tafe(O) will be zero. Thus, Equations (2.7) and (2.8) of [11] are true 
in realistic situations only because tafe(O) = 0 . 

The pathological nature of the limit is emphasized by the fact that in the example 
given, for each N < 00 , the metric is a solution of the vacuum field equations, but 
the limiting metric (3.11) of [11] is not a solution. The backreaction term turns on in a 
delta-function way, in the exact limit only. Genuine backreaction effects should not have 
a delta function discontinuity of this sort: the backreaction term should be non-zero for 

The restriction to integer N is related to a topological assumption, but the solutions are still solutions 
if N is replaced by a continuous parameter 1/A, although they cannot then be 3D spatially toroidal. 
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every metric where A > 0 in any realistic seqnence approaching the backgronnd, as we 
have already pointed out. In the real Universe, inhomogeneities have a hnite amplitude 
and length scale, which should correspond to some A = Aq > 0. One could use a Taylor 
series to estimate the effect at A = Aq from the data at A = 0, but that will not work 
for this singular family of perturbations. 

These examples demonstrate that tab(O) should vanish for a realistic sequence of 
cosmological perturbations, which should be cut off at a finite wave-length before the 
amplitude goes to zero, inter aha because the Sun exists; real perturbations do not 
continue to zero wavelength. Indeed, as we show in Appendix B, the quantity tab(O) is 
only non-vanishing if we impose further restrictions beyond the conditions imposed by 
GW and, especially, if we require non-uniform convergence so that this term only arises 
in the singular limit (see the discussion surrounding Eq. (B.19)). It is worth noting 
that any uniformly convergent limit of a family of spacetimes will inherit some of the 
properties of the family. These properties are called hereditary [48]. A set of vacuum 
solutions with a non-vacuum limit would not respect Geroch’s hereditary properties for 
limits, reflecting the points made above about the GW scheme being singular. 

4 . 3 . An example by Korzyhski 

We will not list here the many investigations of models and exact solutions illustrating 
or quantifying backreaction effects. These works may be found in the reviews mentioned 
in the introduction. We will only point out one recent work by Korzyhski on nonlinear 
effects from multi-scale structure [52] because it closely follows the arguments of GW, 
it precisely illustrates why their arguments do not apply, and it quantihes the GR 
inhomogeneity effect with the help of an exact nested structure model. 

Korzyhski shows that it is possible for small local deviations to produce a large 
global effect. The procedure is closely related to the idea of metric smoothing by 
removing density ripples. His result is not a counter-example to the claimed results 
of GW, because, as he states, his model violates GW’s assumption (ii), according to 
which perturbations should be small not just locally, but with reference to a single 
global background. In Korzyhski’s example, the smoothing procedure picks up effects 
from all intermediate scales, arising as a cumulative effect, and relies on the “depth” of 
structure, i.e., on the ratio between the homogeneity scale and the scale of the smallest 
ripples (recall from subsection 4.2 that no lower cut-off exists for the inhomogeneities 
in the GW formalism). Korzyhski’s example thus illustrates the non-local nature of 
backreaction. In line with this he also has shown in Ref. [53] that backreaction in 
a system of many compact sources can be large even if the metric is close to FLRW 
almost everywhere. The important issue there is the clustering properties of matter. 
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The scalar averaging approach [31,54,55] referred to here is a realisation of the thoughts 
advanced in Refs. [17,18] (for other approaches, see Refs. [46,56-59]). In this section 
we address some misinterpretations of the scalar averaging framework that appear in 
Section 3 of [12]. 

There are three points that must be considered: (i) the difference between spacetime 
notions versus a cosmological 3 + 1 approach, (ii) the difference between the scalar 
averaging framework and metrical approaches to backreaction, and (hi) the importance 
of coordinate-independent arguments. 

Green and Wald also comment (in footnote 6 of [12]) that this approach has the 
difficulty that the system of averaged equations is not closed. This (which we believe is 
the only relevant issue in Section 3 of [12]) has to be emphasized: even if one included 
a further evolution equation for the backreaction variable, as GW suggest, the system 
of averaged equations would not be closed, unless a dynamical equation of state is 
assumed or derived [60]^^. This issue has been spelled out at various places, e.g., 
it was already noted in the original paper [31]. A closure of the system would be 
equivalent to saying that by shrinking the domain of averaging to arbitrarily small 
domains the equations would provide a general local solution of Einstein’s equations. 
Rather, these are conditions on averages similar to, e.g., the tensor virial theorem of 
Ghandrasekhar and Lee [62] that also provides a balance relation between averaged 
variables by including fluctuations^^. 

We start with a common misunderstanding that also appears in [12]. 

5.1. Can there be average acceleration with local deceleration everywhere? 

Yes, because backreaction is non-local. 

Green and Wald state, and this opinion is shared by others since it seems plausible, 
that: “... One can give an example of this sort [13] wherein the ‘backreaction’ is so large 
that one obtains acceleration of the representative FLRW universe, even though each of 
the [disjoint]^^ components of the actual Universe is decelerating”. (Our emphasis). 

A second-order differential equation for the backreaction variable Qt> (as the relativistic counterpart 
to Equation 1) can be derived (T Buchert, unpublished), and it depends on further variables such as 
the intrinsic curvature backreaction appearing in metrical rescaling [46], implying the need to derive 
further equations for these new variables. This situation is akin to the (infinite) moment hierarchies as 
they appear, e.g., in kinetic theory. A recent paper addressing closure illustrates this issue [61]. 

The tensor virial theorem is closed by the virial conjecture of stationarity of the averaged inertial 
tensor for an isolated system (a stationarity hypothesis can also be used in the scalar averaging 
framework, see Ref. [63]). By construction, such balance relations cannot provide the local solutions. 

Literally, GW state “disconnected” and refer to Section 3 of [13], which concerns “a model where 
at time t the Universe consists of two disconnected(!) dust filled FLRW models... Taken literally, 
topologically disconnected FLRW universes are physically less relevant. Here, we discuss the more 
relevant case of disjoint domains whose union constitutes the whole spatial section. This remark 
applies to further citations from GW as in subsection 5.3. 
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The physical effect under question has often been explained in the literature (see, 
e.g., Refs. [64], [5], subsection 5.2): the fraction of the volume of faster expanding regions 
grows more rapidly, so the average expansion rate rises. Formally, we see this by looking 
at the Raychaudhuri equation that governs the local expansion rate of dust matter: only 
a positive cosmological constant and vorticity can lead to an acceleration of the local 
expansion rate 6 (in the sense that the time-derivative of 6 is positive). Putting the 
cosmological term and the vorticity (which is active on small scales only) to zero, at 
first sight it seems implausible that a collection of such decelerating fluid elements can 
lead to acceleration of some patch of the Universe. Let us look at the difference between 
the local and the volume-averaged Raychaudhuri equation for irrotational dust and 
vanishing cosmological constant: 

0 =-dvrG^ + 2II - R ; {ey^ = -A'KG{Q)^ + 2{\\)^-{\)l , (19) 

where we defined the principal scalar invariants of the expansion tensor 0jj, 2II : = 
2/36*^ — 2(7^ and I := 0; averaging is performed on a spatial domain T>, and we have 
used the commutation rule: {9)f, — {6)j) = (9‘^)v — (9 )t> — ((^ “ (6 ')d)^)d. Clearly, by 
shrinking the averaging domain to a point, the two equations agree. However, evaluating 
the local and averaged invariants, 

2II - R = -1^2 _ 2^2 
3 

2 (II)^ - {1)1 = {e)l - 2 (cr)J + l{{e- {e)v)X {^)v)X > ( 20 ) 

gives rise to two additional, positive-definite fluctuation terms, where that for the 
averaged expansion variance enters with a positive sign. 

Thus, the time-derivative of an averaged expansion may be positive even if the 
time-derivative of the expansion is negative at every point in V. This is, technically, a 
consequence of the non-commutativity of averaging and time-evolution and, physically, 
of the non-local nature of averaging that takes correlations into account. Applying this 
fact to a model that is the union of disjoint FLRW submanifolds with spatial boundaries 
implies that the difference in expansion rates of the respective sections is a positive- 
definite expansion variance term (see references to such models in subsection 5.3). 

5.2. Can energy conditions be violated for the average dynamics? 

Again, the answer is yes. It is a possible consequence of what has been said above. 
Green and Wald state a theorem that the effective energy momentum tensor ta6(0) has 
to obey the weak energy condition (see, e.g.. Ref. [11], Eq. (2.8) in Theorem 2), where 
it is written in the abstract that “the leading effect of small scale inhomogeneities on 
large scale dynamics is to produce a trace-less effective stress-energy tensor that itself 
satisfies the weak energy condition”. Green and Wald present in Ref. [11] (section 4) an 
example for backreaction violating the weak energy condition (where they imply that 
this must be a consequence of the violation of the local weak energy condition), and 
where “the limiting metric has an effective stress-energy tensor which is not trace-less”. 
We comment on this example in Appendix G. 
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Green and Wald “emphasize the importance of imposing energy conditions on 
matter in studies of backreaction”, but we emphasize that the effective stress-energy 
tensor may, in general, violate energy conditions: the non-local fluctuation terms 
discussed above may lead to violation of energy conditions upon averaging, although 
locally they are satished. 

This has also been stressed in the literature with regard to the mapping of 
inhomogeneity effects to an effective scalar held that plays the role of a quintessence 
held in standard models for dark energy (see, e.g.. Refs. [1] and [30]). While it is known 
that a fundamental scalar held (e.g., a quintessence held) must violate the weak energy 
condition and other physical properties (e.g.. Ref. [65]) to explain observational data, an 
ehective scalar held need not. Since, as we have shown in subsection 3.6, the framework 
in which the problem is addressed by GW is local, the resulting ehective stress-energy 
tensor would then have to obey the local energy conditions only, and only in the limit 
A \ 0 (recall that there is no ehective stress-energy tensor in the GW scheme for 
A G (1,0)), i.e., their Theorem 2 does not apply to an averaged stress-energy tensor. 
Addressing the real averaging problem would not deliver a local expression. We remind 
the reader that there is currently no agreed way to average tensors in GR. 

Green and Wald’s theorem states that the ehective stress-energy tensor emerging 
from a weak-limit of spacetimes obeying the Einstein equations has to obey the weak 
energy condition. Such a result would be hne but irrelevant for backreaction. 

5.3. Confusion between the scalar averaging framework and metric approaches 

Green and Wald question the framework where backreaction is discussed in terms 
of spatial averages of scalar quantities. They assume that such a procedure assigns 
an averaged FLRW metric to the averaging region, and the “main haw with such 
approaches” is that the metric thus obtained may be far from the real metric, even 
when the latter is close to a FLRW metric, generating “entirely spurious” backreaction 
terms [12]. 

However, this criticism is based on a fundamental misinterpretation of the scalar 
averaging formalism [31,54]: it does not involve any notion of average metric (and does 
not refer to geodesic deviations [22]); only averaged scalars are considered. 

Explicitly, Green and Wald criticise the scalar averaging approach by considering 
two disjoint^^ dust FLRW universes in diherent stages of expansion. In Ref. [12] they 
state “The Buchert prescription would represent this [disjoint] Universe as a single 
FLRW Universe, which provides a bad approximation to the actual metric everywhere.” 
And: “...the Buchert procedure with the above choice of hypersurface instructs us 
to represent the Minkowski metric with an ‘averaged’ FLRW metric 5^(0) that is an 
extremely poor approximation to gab on all scales.” Again, we emphasize that in the 
scalar averaging framework there is no such “instruction” on a metric, apart from the 

As stated above, we consider this property rather than the original term “disconnected”. We also 
interpret “dust FLRW universe” to refer to a spatially bounded submanifold of a dust FLRW universe. 
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spacetime split of the 4-metric (on which we comment below). As explained at the 
beginning of this section, the formalism dehnes spatial average properties among scalar 
variables that depend on second derivatives of the metric. The metric itself is not 
specihed. 

We note that there are indeed investigations in the literature that study so-called 
template metrics [57], [66], [67], [68], [69], that are intended to be compatible with 
the exact average properties (although these are not the papers referred to by GW). 
However, even these papers do not “represent this [disjoint] Universe as a single FLRW 
Universe” (the metric which the standard model exactly assumes). On the contrary, 
backreaction effects are often illustrated on the basis of models with two disjoint universe 
sections and the main effect is identified as being the result of the differences in expansion 
rates (see, e.g. [70], [71], [72], [68]). 

The FLRW metric itself can be viewed as a global template metric. If we hnd that 
the global average spatial curvature today is not zero, then the flat FLRW template will 
constitute a poor approximation of the spatial sections, and small deviations thereof 
will represent large deviations with respect to the physical background [73]. This is 
exactly the problem that a template metric, even a single global template, is supposed 
to correct for. Moreover, as we argued in the context of the steel ball model analogy 
and the weak-limit framework: curvature inhomogeneities are not required to average 
out on an assumed background, as GW a priori impose to be true. 

Green and Wald also argue that the average expansion rate {6)^^ is not physically 
meaningful, and that deviations of {6)^^ from the FLRW value due to large Qv are 
“entirely spurious”, and that “In realistic cosmological situations [...] the comoving 
synchronous hypersurfaces of the Buchert construction will provide a poor choice for 
approximating the hypersurfaces with nearly FLRW symmetry”. (Gompare here our 
discussion of the backreaction term Qx> as it is evaluated in Newtonian cosmology in 
subsection 3.1.) 

It is well-known that the average expansion rate depends on the choice of 
hypersurface, and the issue has been discussed at length in the literature, where it 
has been argued that the physically relevant averaging hypersurface is the one of 
statistical homogeneity and isotropy, see e.g. [5,7,56,64,74-80]. It is irrelevant that 
there are hypersurfaces that are not physically interesting, it only matters that averages 
on some hypersurfaces give physically meaningful results and can be formulated in a 
covariant way. (For discussion of covariance and gauge-invariance of scalar averaging, 
see Refs. [50,81,82].) The average expansion rate evaluated on some hypersurface is a 
useful quantity so far as it gives an approximate description of what is observed, which 
is indeed the case for the hypersurface of statistical homogeneity and isotropy. However, 
for realistic situations, the difference of averages taken on hypersurfaces of statistical 
homogeneity and isotropy from averages taken on the hypersurface generated by the 
fluid’s rest frame and measured by observers comoving with the matter component of 
the Universe, well modelled by dust, is expected to be negligible [79], but this should still 
be demonstrated in more detail. A nontrivial example is provided by the Swiss Gheese 
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model of [83], where the average expansion rate describes the redshift (and to a lesser 
extent the Inminosity distance) well, even in a solntion that is far from the FLRW case. 
We emphasize that, in practice, observations of qnantities snch as the expansion rate 
and density always involve spatial averages (or averages over null geodesics, which can 
be related to spatial averages [79,80,83,84]). Averaging is not a mathematical artifact, 
it is a feature of real observations that has to be properly modelled. 

5 . 4 . Coordinate dependence of Green and Wald’s arguments: the comparison of metrics 

The averaged metric may be a poor approximation to the 5^(0) of the GW formalism 
(recall Eq. 4). Even though the averaging formalism does not refer to an explicit 
metric. Green and Wald argue that this “poor approximation” is an issue. But it 
is misleading to base an argument on the fact that the metric coefficient functions 
are “far from” the values of a background metric. It may be demonstrated that 
such an argument is coordinate dependent. Gonsider as an example a flat metric 
and write it in different coordinates by introducing a diffeomorphism x°' = /“(X^) 
such that the metric coefficients can be transformed to the Minkowski coefficients: 
gabdX°- (g) dX^ = (g) dX^ = ® dx‘^ (the last equality uses the inverse 

of the transformation /“, and a vertical slash denoting a partial derivative with respect 
to the local coordinates X“). The values of the coefficients gab = ’^ay be “far 

from” the values of the Minkowski metric 6^, but in fact they are the same metric 
arising from reparametrising a flat space. 

We conclude that any comparison of metrics by just using metric coefficients is 
unphysical, since it depends on the coordinates used. The question of “how far” one 
metric is from another is a geometrical question that might perhaps be addressed 
quantitatively through curvature invariants. Gomparing two metrics in GR is quite 
an involved task: we can employ Gartan scalars to test for isometry of metrics, see, e.g., 
[85,86], but, as Gartan showed, this may require up n(n + l)/2 covariant differentiations 
of the Riemann tensor in n dimensions. (In four dimensional spacetimes, one only needs 
at most seven derivatives [87].) 

Such a test is justihed by Theorem 9.1 of [85] which tells us that the geometry of 
a sufficiently smooth manifold is locally uniquely determined by the curvature and its 
derivatives to some hnite order. But (a) the arguments by GW do not include comparing 
derivatives and (b) we are not aware of any “almost” version of this theorem, i.e. a 
statement that, if in some region the curvature of the metric g and its derivatives are 
close, in a suitable topological space, e.g., some Sobolev space (c/. Appendix B), to those 
of some metric ( 7 ( 0 ), then g is close to 5 ^( 0 ) (again, in some suitable topological space). 
Note in this context (compare subsections 4.1 and 4.2) that GW’s (and Burnett’s) 
assumption {ii), cf. subsection 3.4, is in some coordinate components, so to be useful 
here the missing theorem would have to include something about how the coordinates 
are chosen to get a limit as GW want and the GW prescription would have to say more 
about the derivatives of the curvature. 

Another way to deal with the issue of closeness of metrics for spatial averaging might 
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be an appropriate generalization of the methods used in [88] which allow identihcation 
of the Schwarzschild metric given the data on any suitably smooth hypersurface. 

Finally, we turn to the issue of whether there is any reason to think that the 
standard cosmological model, which is based on work nearly a century old [89, 90] 
(and unlike the standard particle physics model is the simplest conceivable cosmological 
solution), might not be the best description of our Universe. 

6. Models and observations 

An implicit assumption of the FLRW model is that the dynamics and observations 
of the inhomogeneous Universe can be modelled by the dynamics and evolution of a 
spatially homogeneous and isotropic universe model. This procedure seemed to be 
completely adequate for many decades; indeed, unanticipated discrepancies between 
observations and the FLRW model such as the need for an accelerated homogeneous 
universe model [91,92], compare here Ref. [93], are ascribed to a cosmological constant, 
or more generally dark energy with a possible 2 ;-dependent equation of state. It is 
often stated that we are in an era of “precision cosmology” and that all observations 
are consistent with a “standard” cosmological model referred to as ACDM, which is an 
FLRW model with a stress-energy content of several components (cold dark matter, 
baryons, neutrinos, radiation, and a cosmological constant A) htted to agree with 
observations. 

Another popular approach accounting for the apparent acceleration of the Universe 
is to assume the Einstein equations are not a good description of the Universe on 
cosmological scales and that “modihed gravity” models are required to account for 
the observations. 

In this paper we remind the reader of another possibility. We assume that the 
Einstein equations hold locally, but because of the inhomogeneous distributions of 
matter and geometry the FLRW model fails to adequately describe the observations. 
While the FLRW-ACDM model may indeed be an adequate htting model for many 
observations, future cosmological observations that are more accurate may provide 
evidence that the FLRW-ACDM model is inadequate. It may be that some of the 
many “tensions” between observations and the FLRW-ACDM model already constitute 
such evidence: there are several different observational challenges to the FLRW-ACDM 
model at the 3a level [94-102], and some observational questions have been raised 
about the identihcation of the comoving rest frame on 60-100h“^ Mpc scales—well 
above those of strong-held astrophysical objects [103, 104]. These results contrast 
with GW’s statement that the FLRW-ACDM model is in “excellent agreement with 
all cosmological observations”. However, this paper is not intended as a discussion 
of potential observational problems with (nor criticisms of) the standard model, but 
instead it analyses Green and Wald’s theoretical claims. 

We turn now to observational issues in dealing with the inhomogeneous Universe. 
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It is important to take into account large metric derivatives and corresponding changes 
of observational data interpretation. Green and Wald discuss this, underpinned by 
the claim that Newtonian notions fully capture these changes with the help of their 
dictionary, which is restricted to an assumed near-FLRW situation. We agree with 
their statement “Moreover, the time evolution of the shear and convergence of a bundle 
of geodesics depends on the Riemann curvature (i.e., second derivatives of the metric), 
which can be very large”, but it is not clear that a “Newtonianly perturbed FLRW 
model” can fully account for this. This should be regarded as an open question until 
backed by concrete general calculations. The issue is whether the actual GR solution 
or aspects of it can be constructed via a “dictionary” from a Newtonian solution (not 
shown), not whether a quasi-Newtonian GR solution can be constructed via some rules 
from a Newtonian solution. 

Green and Wald argue that a mapping of solutions of Newtonian cosmological 
equations with periodic boundary conditions and certain properties to GR solutions 
given in Ref. [10] “should yield an extremely accurate general relativistic description of 
our Universe, and, in particular, it provides the leading order backreaction effects at 
large scales produced by small scale inhomogeneities”. However, there are two different 
issues. One is whether certain Newtonian solutions are the limit of some GR solutions; 
the other is whether the GR solution that describes the real Universe is at all times 
close to a corresponding Newtonian solution^^ (see also Ref. [105]). 

Ref. [10] addresses the former question, but it is the latter issue that is relevant 
for providing an accurate GR description of our Universe and evaluating backreaction. 
Even if a particular GR solution starts from initial conditions close to a Newtonian 
solution, this does not imply that the GR solution would remain close to the Newtonian 
solution. As a simple example, in Newtonian gravity an isolated two-body system with 
an elliptic orbit is a stable conhguration, whereas in GR the orbit will decay, and the 
system will be driven far from the Newtonian solution^®. The difference in the evolution 
of the orbit is related to the difference between the Weyl tensor and the corresponding 
quantity in Newtonian theory, the Newtonian tidal tensor. A general GR solution (even 
if the matter is dust) does not correspond to any Newtonian solution. This is related to 
the fact that in Newtonian theory the tidal tensor, corresponding to the electric part of 
the Weyl tensor, does not have an evolution equation, and the tensor corresponding to 

If this were the case, backreaction would be small on the global scale, because in Newtonian theory 
its effect reduces to a boundary term [28], unlike in GR [31]. 

The global stability properties of a FLRW background have been investigated in-depth by a 
dynamical system analysis in the space of physical backgrounds using scaling assumptions (i.e. 
backgrounds that emerge from the average of the inhomogeneous universe model) [106] (see also [107] 
for the class of LTB models). It has been found that the FLRW background is globally gravitationally 
unstable in two sectors, one corresponding to inhomogeneity effects mimicking dark energy behaviour 
on large scales, the other corresponding to inhomogeneity effects mimicking dark matter behaviour on 
small scales. In contrast to these instabilities the Newtonian inhomogeneities average out on the chosen 
background model by construction and this latter is stable [28]. 
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the magnetic part of the Weyl tensor is zero. In GR, both are in general non-zero and 
have evolution equations, see Refs. [78,108-119] for discussion. In [10] the arbitrariness 
of the tidal tensor is hxed by the assumption of periodic boundary conditions. As the 
Newtonian equations are elliptic and do not have a well-dehned initial value problem, the 
boundary conditions are essential [28]. This is quite different from GR, where there is a 
well-dehned initial value problem. Ghanging the evolution of the boundary at distances 
much larger than the GR particle horizon would not impact the GR solutions, but 
can completely change the Newtonian solutions. A lack of backreaction in GR cannot 
be established by starting from the assumption that the Universe is well-described by 
Newtonian theory. 

A common argument against backreaction, sometimes ascribed to [120] and 
repeated by GW is that “by hux conservation, the average of the apparent luminosity 
of sources (including multiple images) must match the FLRW value to a high degree of 
accuracy”. However, if the area element is different from FLRW, then the luminosity 
distance will also be different. Green and Wald assert that it “is a simple fact” [22] that 
if two metrics are perturbatively close, then the average apparent luminosity of sources 
is a close match. Actually, this is incorrect. A counterexample is provided by Enqvist et 
al. [121] in which, while perturbations (and their hrst derivatives) around an Einstein- 
de Sitter background remain small, the luminosity distance can even match that of a 
AGDM model with Ga = 0.7. It is also straightforward to construct exact spherically 
symmetric counterexamples to the flux claim, such as the one in Ref. [122] (see also 
Refs. [123,124]). This issue has been discussed in Section 2.1.2 of Ref. [71] and Section 
4 of Ref. [83]. Even on large scales statistical isotropy and homogeneity is not enough 
to reduce the luminosity distance to its FLRW value, as shown in Ref. [83]. Indeed, 
the violation of the FLRW relationship between the expansion rate and the luminosity 
distance can be used as a test of the importance of inhomogeneities [79,80,83,125-127]. 

6.2. The (quint)essence of the hackreaction approach 

As Green and Wald stress, second derivatives of the metric (i.e. spacetime curvature) 
have to be large in realistic models of the Universe. Averaging or smoothing these 
derivatives leads to an effective stress-energy tensor that is in general not trace-free, as 
we have demonstrated. Hence, this furnishes an argument for the potential importance 
of backreaction. 

The Einstein equations dictate inhomogeneous curvature for inhomogeneous 
sources. A physical cosmology has to capture both inhomogeneities in the sources and 
inhomogeneities in the geometry. Idealising the latter by assuming a homogeneous 
FLRW metric globally, leads to a missing geometrical piece on the left-hand side of 
the Einstein equations that shows up as missing sources on the right-hand side of the 
Einstein equations in the standard model. “Backreaction” takes the inhomogeneities 
in geometry into account and so provides a more realistic average description of the 
left-hand side of the Einstein equations. 
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From first principles, any realistic cosmological model emerging from a 3 + 1 
spacetime split should evolve spatial curvature for evolving sources, even if the spatial 
curvature is set to zero at some initial time. The oversimplified standard model keeps 
curvature at this non-generic value, while hackreaction models evolve curvature, even 
if it is set to zero at some initial time [31]. 

We have to distinguish average properties and fluctuations. Fluctuations may 
remain small on large scales, but that is not the issue: the issue is the background 
about which these fluctuations are small, and this is what is addressed by hackreaction 
models. Metrical deviations from an (unknown) average metric may be small, but it is 
an assumption that this average metric is dynamically equivalent to a homogeneous (and 
flat) FLRW solution. In other words, small metric perturbations do not imply small 
distortions of a flat geometry. In 1916, Einstein already clarified this and explained that 
the perturbations could equally well be small deviations from a large-scale curved space. 
(Einstein, however, reached this conclusion with calculations on which we comment, 
below his quote, from a modern perspective.) This can be considered a variant of the 
steel ball model of GW. In Einstein’s words [128]: 

But it is conceivable that our Universe differs only slightly from a Euclidean 
one, and this notion seems all the more probable, since calculations show that 
the metrics of surrounding space is influenced only to an exceedingly small 
extent by masses even of the magnitude of our sun. We might imagine that, 
as regards geometry, our Universe behaves analogously to a surface which is 
irregularly curved in its individual parts, but which nowhere departs appreciably 
from a plane: something like the rippled surface of a lake. Such a Universe 
might fittingly be called a guasi-Euclidean Universe. As regards its space it 
would be infinite. But calculation shows that in a guasi-Euclidean Universe the 
average density of matter would necessarily be nil. Thus such a Universe could 
not be inhabited by matter everywhere; it would present to us that unsatisfactory 
picture... If we are to have in the Universe an average density of matter which 
differs from zero, however small may be that difference, then the Universe 
cannot be guasi-Euclidean. On the contrary, the results of calculation indicate 
that, if matter be distributed uniformly, the Universe would necessarily be 
spherical (or elliptical). Since in reality the detailed distribution of matter is not 
uniform, the real Universe will deviate in individual parts from the spherical, 
i.e. the Universe will be guasi-spherical. But it will be necessarily finite. 

Einstein emphasized that the small deviations require a more careful description of 
the global spatial curvature. (Einstein mentions a positively curved background, but it 
could as well be negatively curved.) We argue that the deviations should be studied with 
respect to a background that is defined by the actual average distribution (the physical 
background), as it is done in other physical disciplines that investigate fluctuation 
theories, e.g., in solid state physics, not by a priori assumption. 

In a modern perspective, but only in Newtonian cosmology, we can represent the 
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Universe through a hnite flat 3-torus model with Newtonian inhomogeneities on a 
factored out relativistic FLRW model, cf. [28]. In Newtonian cosmology the a priori 
assumption of a background that is equivalent with the average distribution can be 
realised: the spatially averaged model can be assumed to be an FLRW model on 
the imposed scale of the 3-torus due to the globally vanishing backreaction, which 
assures that the inhomogeneities average out to zero, cf. [28]; in general relativity such 
a construction is not possible, and the average model in general does not comply with 
an FLRW model [27]: the physical background will not coincide with an assumed fixed 
background, since it is determined through the dynamics of inhomogeneities. 

Discussions restricting the regime of applicability to regions “far from black holes 
or neutron stars” overlook the fact that on cosmological scales the curvature can be tiny 
but still dominant over the source contributions. Even in homogeneous cosmology, the 
Einstein cosmos [89] provides an example of a curvature-dominated model, its curvature 
radius being smaller than the Schwarzschild radius if the former is greater than about 
3.3 Gpc (assuming a mean density of the present estimate of the total baryonic plus dark 
matter densities and the observed value of the Hubble constant). In other words, this 
example shows that if the curvature is cosmologically significant small metric deviations 
from a flat metric would run into contradictions. 

More generally, even if fields are weak then an important consideration is the 
calibration of the asymptotic rulers and clocks of the close to spatially flat metric 
around bound structures relative to a generally non-FLRW cosmological average. 
Since no systems are truly isolated, this may generally require a notion of “finite 
infinity” [17,129-131] to replace the exactly asymptotically flat geometry to which many 
definitions of gravitational mass are tied. The problems of averaging and coarse-graining 
may therefore be intimately related to fundamental unsolved problems concerning 
gravitational mass-energy [6]. 

Finally, in this paper we focussed on backreaction effects in general relativity, as 
this is also the framework addressed by Green and Wald in their papers. However, there 
could be more degeneracies relating observations with theoretical predictions: we did 
not touch upon issues related to different theoretical frameworks such as the possibility 
of a non-symmetric connection, the role of torsion, non-metrical theories or, generally, 
modified gravitational theories that contain general relativity as a low-energy limit. 
A general assessment of backreaction effects would also incorporate studies of possible 
deviations from classical general relativity. 

7. Conclusion 

There is no proof that backreaction of inhomogeneities is irrelevant for the dynamics 
and observables of our Universe. 

The most detailed study claiming to show that backreaction is irrelevant is in a series 
of papers by Green and Wald [9-12] and a precursor paper by Ishibashi and Wald [13], 
so we have examined those papers in detail (in particular the review in Ref. [12]). We 
have pointed out several issues that lead us to conclude that these do not provide a 
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proof that hackreaction is irrelevant. In particular, we have demonstrated that the 
claimed trace-free nature of hackreaction is unphysical and is not relevant within the 
hackreaction framework. We have also shown that the GW framework does not address 
the actual question of hackreaction since i) it is not concerned with averaging; ii) it does 
not include terms that should he present in a consistent treatment (especially, the trace 
parts); hi) its relation to the cosmological situation is unclear; iv) the example steel hall 
analogy is not concerned with the htting problem; v) the ultra-local limit procedure 
does not take into account that hackreaction is a non-local effect and vi) the criticisms 
misrepresent existing work on hackreaction. 

While the GW framework is not applicable to studying hackreaction, it is possible 
that further developments of their framework may provide useful mathematical results. 

Remark on Green and Wald’s note in response to this paper 

Shortly after this paper was submitted. Green and Wald responded with a note [22] in 
which they redefined the word “hackreaction” to refer to the specihc setting they address 
in their formalism, labelling other approaches—which constitute most of the papers in 
the literature—as “pseudo-backreaction”. They further clarified that their formalism 
was “never intended or claimed to apply” to such approaches'^, and, in particular, that 
their results “do not apply to the Buchert formalism” or to “LTB models” modeling 
large voids [22]. Regardless of terminology, the physical question of interest is whether 
inhomogeneities have a significant effect on the evolution of averaged variables that 
correspond to observable quantities. We have explicitly demonstrated that the definition 
of hackreaction assumed by Green and Wald is too narrow to address this question. 
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In order to address the analogy given by GW, we have to investigate a well-dehned 
intrinsic geometrical mapping between (S^, g^oi) and (S^, Pcan)- Even if we restrict 
ourselves to the two-dimensional case, as GW did—and we emphasized that this case is 
physically irrelevant for the purpose of investigating backreaction—we can exploit the 
Thurston-Rodin-Sullivan e-approximation to the Riemann mapping theorem [132-137] 
(associated with the circle packings techniques developed by W. Thurston). This is not 
a perturbation argument, but rather a deformation argument in the space of (metric) 
geometries over A more direct and easier variant of this would be the PoincarWKoebe 
uniformisation theorem directly formulated for two-dimensional polyhedral metrics (the 
existence of this version of the standard uniformisation technique is a consequence of 
the fact that conical singularities are not seen by the conformal structure of surfaces, 
and this has been nicely developed by many authors [23-25]). Both are mathematically 
well-dehned and constructive procedures. They provide an explicit mapping (typically a 
conformal transformation) between (§^, g^oi) and (§^, gcan)- These mapping techniques 
are currently applied in the modelling of two-dimensional discrete structures in helds 
ranging from image analysis to biology and medicine. In particular, both approaches 
would map (S^, g^oi) to a well-dehned (S^, gcan) without any conical defects whatsoever. 

We would further like to correct the statement by Green and Wald that the metric 
of the polyhedron fails to be smooth also at the edges. The metric of (§^, g-^oi) is 
perfectly smooth at the edges. The bending we envisage at the edges, suggesting a lack of 
smoothness there, is an artifact of the embedding in Euclidean space (E^, 6). It is related 
to the (discretized) second fundamental form (the extrinsic curvature) of (S^, gpoi), and 
if we take it into account, then we need to consider the (discretized version) of the 
Gauss-Godazzi constraint describing the compatibility conditions between the intrinsic 
and extrinsic geometry of the embedding (§^, g^oi) ^ (E^, 5), see, e.g.. Refs. [138,139]. 

Appendix B. The nature of the weak—limit in the GW approach 

This is a technical appendix where we discuss the mathematical and geometrical 
underpinning of Green and Wald’s approach. There are subtle assumptions and 
hypotheses underlying their claimed results that are unclear and not explicitly stated. 
Even when these hypotheses are spelled out clearly they are not always adequately 
implemented, eventually leading to insufficiently justihed conclusions. As a typical and 
important example, let us take the assumptions {ii)-{iv), according to which [9,12] 
the components of the tensor helds 'jab{x,\) and of Vajabix, X) are locally in Li, and 
bounded by smooth functions XCi{x) and C 2 {x), respectively, see Eqs. (7) and (8). 
As often emphasized in [9, 12], no restrictions are placed upon second derivatives of 
■Jab, with the effect that [12] “... second derivatives (i.e., the curvature) of gab may 
have unbounded huctuations relative to 5'at<(0).” Nonetheless computations in [12] are 
performed as if the spacetime metrics considered are smooth, as is conhrmed in the 
recent explanatory note [22]. Unfortunately, given the above hypotheses, one cannot 
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safely carry out computations by declaring that second derivatives of the perturbing 
tensor 7 may be unboundedly large, and then treat them as ordinary smooth functions. 

In fact, {ii)-{iv) imply that the GW effective stress-energy tensor fab(O) cannot 
be dehned in terms of the tensor held pi alone, (cf. (9)): it may have components 
that in general make sense only as non-regular distributions. Furthermore, as we shall 
demonstrate below, under {ii)-{iv) there is a very delicate regularity issue in taking a 
weak-limit, with the consequence that one cannot prove that the trace of fa6(0) vanishes 
without further hypotheses additional to those in (ii)-(zu). In this sense, GW’s Theorem 
1 (the vanishing of the trace of the effective stress-energy tensor), is the counterpart of 
the observations we made for the steel ball analogy: as in the analysis of the steel ball 
model, curvature fluctuations matter; their treatment requires an accurate analysis. 

In order not to have unstated assumptions and unclear formal computations, we 
address the mathematical analysis of GW’s approach stepwise. To avoid the pitfalls 
of doing analysis with potentially unbounded quantities (here the second derivatives of 
the perturbation 7 ), we start by providing a Sobolev space analysis of the regularity 
issues associated with GW’s assumptions {ii)-{iv). We use Sobolev spaces to get to 
the point as quickly as possible and to pinpoint the origin of the problem lurking in the 
background. Those not familiar with Sobolev space technology may skip this part of the 
analysis, if they are willing to accept the fact that taking weak-derivatives and weak- 
limits requires some caution when differentiating or integrating by parts. In successive 
steps we address the technical limitations of GW’s approach, which give rise to the stated 
difficulties in correctly dehning their effective stress-energy tensor tab{0)- Many of these 
difficulties arise because of subtle issues in the adoption of the Burnett formalism, [33] 
on which they heavily rely. 

Step (i): To begin, we introduce, as do Green and Wald, a positive dehnite metric 
e := eabdx^" ® dx^ on the manifold M with respect to which we can take local norms 
of tensors in dehning the relevant functional spaces. In particular, for an open set 
f/ C M we denote by Lf^^(f/), 1 < p < cxo, the space of Borel measurable functions 
/ dehned on U which are p-summable, he., ||/|| lp ( a ') := Jx \f{^)\^dpie{x) < 00 for 
every compact set K <Z U, where d/ie denotes the Riemannian measure associated 
with the metric e. For p = 00, we denote by L ^^{ U ) the Banach space consisting of 
the essentially bounded functions on U, endowed with the essential supremum norm 
11 /I loo := inf {C G M : / < G almost everywhere in U}. We also introduce the class 
of functions in L\^^{U) whose weak hrst derivatives, (see below for dehnitions), are 
also -functions. These are the Sobolev spaces Wlff(U) 1 < p < 00, often used 

in geometric analysis and mathematical general relativity. Their dehnition naturally 
extends to the appropriate spaces of tensor helds. For further details see, e.g.. Refs. 
[37,140-142]. 

It is also appropriate to recall the dehnitions of distributional and weak derivatives. 
Entering into such mathematical detail may appear pedantic. However, the diherent 
properties of these two derivatives play a subtle, but fundamental, role in what 
follows. As usual, let C^{M) be the space of smooth compactly supported functions 
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on M. If we denote by Tj the regular distribution associated with / G LlciUl i.e., 
Tf^cj)) := f[x) dpie{.x), for 0 G C^{M), then Tf always admits a distributional 
derivative V Tf, dehned according to VT/(0) := — T/(V 0), viz.. 

VTficf) -.= - [ f{x)V(f{x)dfie{x), V0 G C^{M) . (B.l) 

J M 

If / is continuously differentiable, we can integrate by parts on the right-hand side, 

Vr/(0) f{x)V(f){x)diie{x)= f V/(a;)0(x)d/ie(x) = Tv/(0) , (B.2) 

J M J M 

which provides the usual, often abused, rationale in manipulating distributional 
derivatives. Higher-order distributional derivatives are dehned in a similar way. In 
nonlinear problems, like the one we are discussing here, it is more useful and sometimes 
indeed necessary to consider, rather than the distributional derivative, the more 
restrictive concept of weak derivative. Let / G L\^^{U) be a locally integrable function 
on the open set U ^ M and let 7/ be the corresponding regular distribution. If there 
exists a locally integrable function F G such that V Tf := Tp, viz. 


/(x) V0(x) d/ie(x) = 


F[x) (j){x) dp,e{x) , V0 G 


Cf(M) , 


(B.3) 


' M 


’ M 


then we say that F is the weak derivative of /, and write F = V f. Weak derivatives 
emphasize local summability, whereas distributional derivatives put the accent on 
differentiability. The two notions agree for smooth functions, and often the notion 
of weak derivative is tacitly traded for the dehnition of derivative in the sense of 
distributions. But the two have quite different properties. In particular, the weak 
derivative may not exist: two well-known examples are the Heaviside step function and 
the Cantor function. Neither admits a weak derivative whereas they both do have a 
distributional derivative: the Dirac measure supported at the origin, and the Lebesgue- 
Stieltjes measure supported on the Cantor set, respectively. Both of these distributions 
are non-regular distributions, i.e., they cannot be associated with a locally summable 
function. Note also that the existence of a weak derivative is not equivalent to the 
existence of a pointwise derivative almost everywhere. 

Given these technical issues one may ask why we should not simply interpret GW’s 
formalism in the distributional sense, as suggested by their use of the weak topology 
(i.e., weak-limits and integration by parts), and conhrmed by their recent explanatory 
note [22]. To illustrate why not, we will present a simple example to show that there is a 
price to pay if one goes distributional without due care, in particular when interpreting 
weak-limits as a form of averaging. The example in question is related to the vanishing 
of the weak-limit w-limA^-o VaVf, 7 ce(A), a basic result which has many implications in 
GW’s approach. 

For simplicity, let us work on M, and consider the function (playing the role of GW’s 
metric perturbation 'yab{x, A)) 

.2 2 


'y{x,v) := 




y/dwrj 


e iv = T]^p^{x,r]-, 0) , 


(B.4) 
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where p{x,ri; 0) is the heat kernel in M with source at x = 0, and where rj G (0,1] (i.e., 
we are taking A := r/^). One has 


d 

dx 


-f{x,r]) 


XT] _ 

- ;- e 477 

2 ^/Anri 


dx^ 


l{x,ri) 



1 ^ 

2 / yJAnr] ^ 



(B.5) 


Clearly, for all r; > 0, we have smooth functions, i.e., 7 (x, r/), ^ 7 (x, 7 ), and ■^'y{x,rj) 
are Moreover, 0 < 7 (x, 7 ) < rf {An, and 0 < |^ 7 (a;, 7 )| < 

7] {An 7 )“ hence 7 (x, 7 ) tends to zero as 7 \ 0, uniformly in x, while ^ 7 (x, 7 ) —)■ 0 

pointwise. Finally, since Prj{x,ri; 0) is not locally summable, ^ 7 (x, 7 ) tends 

to zero in the sense of distributions. Explicitly, let T(a; 2 / 4 _^/ 2 ) and Tp(^x,ri-,o) be the 
regular distributions associated, for any hxed 7 G ( 0 , 1 ], with the smooth functions 
(t ~ f) 0)- Since, for any 0 G C“(M), (^ — ^ and 

lim^x^o / x^ 0 (x)p(x, 7 ; 0 ) dx = (x^ 0 (x))|a;=o = 0 (by the known properties of the 
heat kernel p{x,rj; 0 )), we can write 


x 2 

lim Ttx^ 14—1^12) Tp(x,ri-,o) . *^o 0 , (B. 6 ) 

ri\0 4 

where 5o is the Dirac measure supported at x = 0. This shows that using distributions 

is a far cry from what we would naturally consider, i.e., the vanishing of (d^/dx ^)7 
resulting from a suitable average of a locally summable function over the region of 
interestThe above example clearly shows that, rather than an average, a direct 
distributional approach may give rise to measure concentration phenomena (associated 
here with the heat kernel measure e dx concentrating, as 7 \ 0 , the zero 

weak-limit of (d^/dx ^)7 at the origin x = 0 ). 

One may argue that, in contrast to the pathological example (B.4), the oscillating 
to death function 'y{x,ri) = rj sin(x/? 7 ), 0 < 7 < 1 , often quoted in [ 12 ], is a simple 
example modeling the assumptions {ii)-{iv), with a nicer behaviour. In this case we 
have = cos{x/r]), and ^ = —ksm{x/r]). Since sin(x/ 7 )dx is uniformly 

bounded, one immediately gets lim,j\^o Td'2'y{x,rj)/dx^ = 0 as a regular distribution. 
Moreover, one easily computes 


/dx"^ 2 ’ (^•'^) 

which, while indeed providing an elementary realization of GW’s assumption {iv), 
also shows that modeling 7 ab(x, A) and its derivatives (hence gab{x,X)) as regular 
distributions—complying with {ii)-{iv) —is quite problematic: there can be a lack 
of continuity in the relevant geometric operators^®. In particular, the mapping that 
associates to the metric gab{x,X) its Riemannian curvatures, say the Riemann tensor 
gab{x, A) I —> 7l'^abc{x, A), cannot be a continuous map between the appropriate space of 


In Green and Wald’s papers it is often suggested that the weak-limit “corresponds roughly to taking 
a local spacetime average”. In particular, the concluding paragraph of [12] strongly emphasizes this 
aspect. 

The relation (B.7) is a standard example proving that multiplication is not a continuous operation 
in distribution space. When computing curvatures for gab{x,X) we need to use the product 

^ 7 ( 37 , 77 ) ^d^'y(x,ri)/dx^ • 
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tensor fields, if the weak-limit and derivatives are interpreted distribntionally withont 
dne care. For these reasons it is more exact to use weak rather than distributional 
derivatives. Even in this case distributions may nonetheless arise, as we shall see in 
the computation of the Einstein tensor for low regularity metric perturbations such as 
those occurring in GW’s case. However, our calculation will be done in a controlled way 
from the point of view of geometrical analysis, allowing us to pinpoint the origin of the 
potential problems in the formalism. 

Green and Wald’s assumptions {ii)-{iv) imply that each metric tensor g{X) along 
the curve of metrics (5), is a low regularity (perturbation of a) Lorentzian metric. 
The minimum assumptions needed to dehne the Riemann tensor (and all attendant 
curvatures) along (5), and at the some time comply with GW’s assumptions {ii)-{iv), 
are to assume that on the smooth background provided by (M, 5 '( 0 )) the components 
of the tensor held 7 ab(A) are essentially bounded and, together with their derivative, 
are locally square summable. Indeed, these are the minimal technical requirements 
needed to safely carry out most of the formal computations in Green and Wald’s papers. 
Explicitly, we require that around the generic point in (M, (/(O)) there exists a local 
coordinate neighbourhood {U, a;“), U C M, where we can write 

gab{X) = gab{0) + 7ab(A) , A G ( 0 , 1 ] , (B.8) 

and where the components of the perturbation tensor held 7 are such that 

e <"(f/)nL£(f/), (B.9) 

for all A G (0,1]. Here, according to the notation introduced above, is the 

(local) Sobolev space of sections which together with their hrst derivative are square 
summable (in U), and denotes sections which are essentially summable in 

U, (the relevant Sobolev norms on (M, 5 '( 0 )) being dehned in {U, x°‘) [37,140,141]). 
Note that GW also impose the smallness of 7ab(A,a;) as A \ 0, by assuming that 
|7ab(A)| < AGi(a;) for some smooth positive function Ci{x) on M. In our setting this 
is just a smallness constraint on the local components of 7 and does not imply the 
smallness of the Wlf^(U)-noiTsi of 7, in line with the GW formalism. 

It is convenient to take (M, (^(O)) to be a generic background metric (similar to the 
case of the almost-everywhere hat metric in the steel ball analogy). This is in line with 
the remarks in GW that their computations hold for generic background metrics. By 
so doing we can better appreciate the diherences arising from assuming FLRW as the 
background. 

The Sobolev space assumption above provides a local control on 7 and its hrst 
derivatives that is consistent with the simpler GW hypotheses {ii)-{iv), but at the 
same time allows us a more precise analytical control on the geometrical tensor helds we 
need to use. To wit, since the set Wyf^{U) fl L^^{U) is an algebra under pointwise 
multiplication [141], we can dehne all algebraic manipulations^® of g{X) and 7(A). 

We can define all algebraic manipulations in the algebra associated with the product of the equivalence 
classes of functions defined almost everywhere. 
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Moreover, V 7 (A) and the Christoffel symbols ra 6 (A) of ^(A) are in Ll^{U). More 
precisely, from the standard formula relating the Christoffel symbols of the two metrics 
(M, and (M, g{X)) we have 

C(A) = rt(g(0)) - ivtg«(A) (gM(A)g,,(A)g“‘(A) - gM(A)iy“ - 9,,(A)i7“), (B.IO) 

and we can symbolically write, in terms of the functional class, 

r“,(A) e C^{U) + lUu) , (B.ii) 

where C°°{U) here and in the following formulas refers to the smooth part of the relevant 
quantity under discussion, associated with the background (M, (assumed to be 

smooth). 

Since V 7 (A) G it follows that V 7 (A) V 7 (A) is in i.e., it is locally 

summable for all A G ( 0 , 1 ] and hence there exists the weak-limit tensor jOabcdf dehned 
by (9) as a regular distribution locally represented by a (tensor-valued) continuous 
function. In general, this is not a smooth tensor field, as assumed in (iv). However, 
the smoothness requirement is not really relevant to the GW argument since the terms 
causing trouble are the second derivatives of 7 . Indeed, from the (formal) local dehnition 
of the Riemann tensor components in {U,x°‘), 

:= daTtiX) - dbTtiX) + r-(A)rL(A) - C(A)rL(A),(B.12) 

it follows that the terms daTf^ — in R'^abcW^ containing the second derivatives of 
7, can be given sense only as elements of the distributional space (roughly 

speaking, the topological dual of fRiof (H)), whereas the quadratic terms containing the 
first derivatives of 7, i.e. are, as introduced above, in L\„(U). Clearly, 

since (M, (^(O)) is assumed smooth, in the second-order linear part of the Riemann 
tensor, there is a smooth part (cf. B.ll), and we can write 

Kbc{>^) e W-l’\U) + LL(R) + C°°(f/) . (B.13) 

Since contraction is an algebraic operation, the same situation holds also for the Ricci 
tensor, and we have 

Rac{X) := Ri,,{X) G W-l^\U) + Li^iU) + C'°°(R) . (B.14) 

The situation for the scalar curvature is subtler: in order to contract the Ricci tensor we 
need to trace the distribution-valued tensor components Rac{X) with the components 
g^^{X) of the inverse metric, (which are in WlflifJ) fl L'^^(U)). For the components of 
the Einstein tensor we have 

Gab(A) G W-^^'\U)+ C°^{U) , for A G (0,1] . (B.15) 

Step (ii)\ Clearly, what makes the components Gab{X) of the Einstein tensor potentially 
described by singular distributions is the presence of the VV 7 (A) terms. Since we are 
dealing with a family of metrics A —> g{X), X G (0,1], it may well happen that 


For a similar treatment of low regularity metrics in Riemannian geometry, see [143]. 
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the distributional part of Gab{^) vanishes in the limit A \ 0 as a consequence of 
the GW hypotheses {ii)-{iv). Even if this is the case, in order to give substance to 
GW’s formalism, we must carefully check that this vanishing weak-limit is supported 
on a set of signihcant measure and not associated with a delta-like measure or with 
a discontinuous dependence on the averaging scale (see the example (B.4)-(B.7)). In 
fact. Green and Wald claim that with the weak-limit operation many of these second 
derivative terms over the smooth background (M, (/(O)) vanish in the limit A \ 0. 
In particular, they assume in [9] that the terms (7ifc(0) VaV 67 e/(A) weakly vanish in 
the A \ 0 limit. Explicitly, by referring to the terms containing two derivatives of 
habW, they state^^ just before formula (13) in [9] “These terms can be classihed into 
the following types: (a) terms linear in hab(A), corresponding to the linearized Einstein 
operator acting on hafe(A); (b) terms quadratic in hab{\), corresponding to the second- 
order Einstein operator acting on hab(A); and (c) terms cubic and higher-order in hab(A). 
The weak-limit of terms of type (a) vanish by the type of argument leading to (3)....”. 
Actually, in claiming that the terms of type (a) vanish by the type of argument leading 
to (3), they implicitly extend Burnett’s argument [33] (as exploited in Eq. (3) of [9] 
for Erst derivatives of 'jab) to second derivatives, an extension that under their stated 
hypotheses is not guaranteed. 

Explicitly, and regardless of the potential distributional nature of the Einstein 
tensor discussed above, the problem in GW’s approach lies in analysing the weak-limit 
of their equation (12) in Ref. [9]. This is the core of their work, but unfortunately, the 
analysis there is performed with some hidden assumptions. Given the importance of this 
point, let us describe it in detail. What GW exploit in proving the vanishing of the weak- 
limit of terms of type (a) is the direct application of case (b) of “Burnett’s Theorem” 
by identifying Burnett’s tensor held a(A) with V 7 afe(A), (cf. Ref. [33], pp.95-96, and in 
particular the top of p.92 where Burnett claims that “V G^^(A) —> 0 weakly”). Green 
and Wald implicitly assume that, for any compactly supported test tensor held density 
yafece, write 

W-lim VaVb7ce(A) = lim [ r^^^VaVblce = -lim f VaF^^Vb^ce = 0 , (B.16) 

where the hrst step is integration by parts of the term VaVb'lceW, and where in the 
second step one exploits the weak convergence to 0 of Vb 7 ce(A). 

This would be hne, if we interpreted (B.16) in the distributional sense and exploit 
the continuity of the distributional derivative. However, as shown above, this may 
generate a vanishing weak-limit associated with concentration phenomena or exhibiting 
a discontinuous dependence on the averaging scale. To avoid this we need to require 
that in the limit A \ 0, the tensor components VaV;, 7 ce(A) are represented by functions 
which are at least locally summable. But this strict weak derivative representation is 
also problematic, since it is not a priori allowed under the general assumptions on 7 ce(A) 
adopted by GW, in particular if Va'^bJceW can be unboundedly large. This indicates 

^^Note: /lab(A) of [9] is 7af)(A) here and in [12]. 
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that local summability may also be a very delicate issue in GW’s formalism and cannot 
be taken for granted. In particular, in order to define the quantities VaVbIceW in the 
weak sense and give meaning to (B.16) without nonregular distributions coming in, we 
have to assume, according to the very dehnition (B.3) of the weak derivative, that for 
every A G (0,1] there is an array of functions FabceW = , Fn{X)), each of 

which is in such that 

J = - J Vb7ce(A)V,r'“, (B.17) 

for all compactly supported test tensor densities These Fabce{X) are the weak 

derivatives of Vfe 7 ce(A), namely we can symbolically write FabceW ■— “VaVb7ce(A)”. 
In the GW framework there are further constraints in addition to local summability 
that FabceW niust then comply with. Since 

w-^im Vb 7 ce(A) = 0 , (B.18) 

the weak characterisation (B.17) of VaVfe 7 ce(A) and the GW assumptions (ii), (Hi), 
(iv) necessarily require that the locally summable functions FabceW must have a 
vanishing weak-limit as A \ 0. Moreover, we have to artihcially require that this 
weak convergence to zero of VaVft 7 ce(A) is not uniform in A, i.e., we must require that 
VaVfe 7 ce(A) are not uniformly bounded as A varies. This latter constraint is necessary 
in order to have that 

wMim 7cd(A) VaVb7e/(A) = -p-abcdef , (B.19) 

A ^0 

(cf. Eq. (13) in Ref. [9]). It is easy to see that, if the weak convergence of VaVbXceiX) 
to zero were uniform, then the weak-limit (B.19) would be 0. 

This non-uniformity, (more or less tacitly assumed in GW’s (ii), (Hi), (iv)), is 
again an argument that GW [9] take from Burnett (cf. the displayed equation on top 
of page 92 of Ref. [33]). In Burnett’s words “Does {gdeW — 5'(ie(0)) Vm G^^(A) —> 0 
weakly as X — > 0? Not in general! Although gdeW — gde{0) — > 0 uniformly and 
VmC“f,{X) —y 0 weakly, VmG^^(A) need not be uniformly bounded. In fact, if gabW 
also satisfies condition (iv), then {gdeW “ 5 'de( 0 )) Vm G^j(A) converges weakly to some 
expression in fimnabcd- ■■■”■ 

Whereas this lack of uniformity makes sense in the high-frequency limit in 
gravitational wave theory—simply because non-uniformity is almost intrinsic to wave 
propagation in the high-frequency regime (typically modelled after the behaviour of 
the function A sin (x/A))—it seems unlikely to be justihed in cosmological backreaction 
where there are hnite averaging scales dehned by the gravitational system considered: 
what does “high-frequency” mean in this setting? Is this ultra-local limit sensible in 
cosmology? 

In a realistic cosmological averaging procedure, we may allow very high density 
contrasts 5p/ p S> 1, (hence very large VaVb 7 ce(A)), but we have to control the averaging 
scale over which this oscillating contrast is relevant and hence assume a uniform 
boundedness hypothesis on VaVb 7 ce(A). Otherwise, if we want to have potentially 
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unboundedly large VaVb'Jcei^) while at the same time avoiding the distribution-valued 
curvature that, as we have seen, naturally develops under such hypotheses, we do need, 
as in GW’s formalism, a rather artihcial hue-tuning between the non-uniformity of the 
upper bound of |VaVfe 7 ce(A)| and the relevant weak-limits as A \ 0. This is not a minor 
technical point, since it is this non-uniformity in bounding the size of VaVb 7 ce(A) which 
is responsible for the sudden activation (at A = 0) of the trace-less backreaction in GW. 
If we make VaVblceW uniformly bounded as A \ 0, then there is no Habcdef to play 
with. 

Summing up, we have to require (as a necessary and sufficient condition for (B.16) to 
hold and produce GW’s results) that, (extending the GW enumeration of assumptions 
(z), (iz), (izi), (iv))\ 

(v) For every given A G (0,1], there is a sequence {Vfe 7 ce^(A)}„gN of functions 
Vfe 7 ce^(A) e C°°{U) such that, as n —)■ oo, Vb7ce\i\) — Vb7ce(^) in and 

(VZ) V,Vb7i:K^) ^FabceW, 

(vzz) V.Vfe 7 l^)(A)|A\o^O, (B.20) 

again in (These requirements simply follow from the standard characterization 

of weak derivatives as limits of sequences of derivatives of smooth functions.) Moreover, 
(via) The sequence VrfVfc 7 ce^(A) G C°°{U) fl L\^JJJ) must not be uniformly bounded 
(as a function of A) as A \ 0. If such a sequence exists, then we can identify the (weak) 
derivative VaV^yce^A) of Vfe 7 ce(A) with Fabce{^) and this weak derivative will have the 
property w-lim VaV^yce^A) = 0, required by the GW formalism. 

These remarks show that w-lim VaVb 7 ce(A) = 0 is not a consequence of the 
GW hypotheses (ii), (in), (iv) but a strong a priori assumption that resembles GW’s 
interpretation of the steel ball example (where they a priori kill the angular defects in 
the limit). To reach the conclusion that VaVb7ce{^) weakly vanishes as A \ 0, and 
produces non-trivial results we must add to the GW assumptions {ii)-{iv) the further 
hypotheses {v)~{viii) described above, some of which lack physical justihcation. 

All in all, we cannot conclude, with any reasonable level of geometrical and physical 
rigour, that the weak-limit of the (linear) terms 5 f( 0 )VV 7 (A), present in Gab(A), 
vanishes as A \ 0. A minimum requirement for this to happen is a much stronger and 
unphysical control {ii)-{viii) on the curve of metrics A i —> g{X) than that associated 
with GW’s assumptions {ii)-{iv). In particular, we wish to emphasize that the (implicit) 
non-uniformity of GW’s weak “averaging” is highly formal in a cosmological setting. 
GW’s formalism depends on this assumption in an essential way. 

We conclude: replacing the non-uniform boundedness of VV 7 (A), as A \ 0, with 
a more natural boundedness assumption, GW’s formalism becomes empty, insofar as, 
by removing the non-uniform boundedness (in A) requirement on VV 7 (A) as A \ 0, 
the a priori assumptions (i)-(iv) made by Green and Wald can only be self-consistent 
if the tensor held pZabcdefif)) vanishes identically. 


No proof that hackreaction is irrelevant 


37 


Appendix C. Comments on examples for hackreaction that aim at 
including matter inhomogeneities 

Example by Szybka et al. 

Let us consider Szybka et al.’s [47] example in relation to GW’s claim that hackreaction 
is trace-free. The example is based on the Wainwright-Marshman metric: 

dg 2 _ ^ 2 m^n ^ ^ ^ 1/2 ^^^2 ^ ^ ^ 2 ^ ^^2 ^ 2w dxdy] , (C.l) 

where t > 0, —oo < x,y,z, < +oo, m is a free parameter, n and w are functions of a 
single variable u := t — z, and the held equation 

dn/du = {dw/duY (C.2) 

helps to simplify the Einstein tensor. The stress-energy tensor is that of a perfect huid 
with equation of state 

^ = p = + , (C.3) 

Svr 

so the weak energy condition holds for m > —3/16. Szybka et al. [47] set w := X sin(M/A) 
which gives n = | (u + iAsin(2M/A)). In the limit: limA\oW = 0 and hmA\o^^ = 
hence (C.2) no longer holds for the background. 

The Ricci scalar of the metric gab{X) is R{X) = —4 (16m + 3) t“ 2 (m+i)g-n(A)^ where 
for the background metric 5'afe(0), n should be substituted by limA\o’^- This can be 
used as follows to show that, although the derivatives of metric deviations and quadratic 
products thereof in this example are singular in the limit as A goes to zero, the Ricci 
scalar and the stress-energy tensor are not and thus cannot produce hackreaction from 
inhomogeneities. First, let us subtract the Einstein equations for A > 0, 

RabigW) - ^gab{X)R{g{X)) = SnT,,{X) , (C.4) 

from the background dynamical equation; for A = 0 (using the notation tafe(O), cf. 
Sect. 3.7), we have: 

RabigiO)) - ^gabiO)Rigid)) = Svr (Tab(O) + Gb(0)) . (C.5) 

We take the w-lim of the difference (C.5)-(C.4) (recall that tafe(O) remains unaffected 
by the weak-limit operator): 

tabid) = w-lim iRabigid)) - RabigiX))) , (C.6) 

A ^0 

because w-lim (^(^((O)) - ^(^((A))) = 0 , w-lim ( 5 („b( 0 )R( 5 (( 0 )) - 5 (ab(A)R( 5 ((A))) = 0, 
and w-lim (Tab(O) - TatiX)) = 0. 

Introducing the trace-free Ricci tensor Sab = Rab — jgabR, we get: 

Gfe(O) = w-lim (S'ab(5'(0)) - SabigiX))) , (C.7) 

A ^0 

since again w-hmA\o igabid)Rigid)) — gabiX)RigiX))) = 0. From this we see that 
the hackreaction term tab entirely emerges from the trace-less A-dependent curvature 
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{Sab{g{^)))- The Ricci scalar and the stress-energy tensor, where the density 
inhomogeneities are encoded, cancel ont. Since the Wainwright-Marshman spacetimes 
are interpreted as cosmological models with gravitational waves, we can attribnte all 
the hackreaction present in this example to gravitational radiation, not to density 
inhomogeneities. 

Example by Green and Wald 

Now we consider the example provided by GW in [11] (section 4). We recall GW’s 
assnmption (i) [9], he., that there exists a family of metrics PabW and a smooth fnnction 
Cl snch that: 

• QabiX) represents the real Universe, while gabX) is the backgronnd, averaged metric; 

• Gab{gX)) = 87rTafe(A) for all A > 0 and Tab{X) obeys the weak energy condition; 
and 

• hab{X) = |5'a6(0) — gabiXl < • 

What does it imply for gabX) to b® ^ solntion of the Einstein eqnations? For A = 1 this 
is trne by dehnition, since we assnme that the real Universe is well-described by general 
relativity. However, while we have a straightforward prescription of how to constrnct 
GabigiX) for 1 > A > 0, the sitnation with Tab{X) is not so clear. 

We can distingnish two reasonable approaches: 

(i) we calcnlate GabigiX) define Tab{X) ■= -^GabX) (this is the option chosen by 
Green and Wald); or 

(ii) we calcnlate Tab for a given A from its dehnition in terms of the non- 

gravitational part of the Lagrangian density of the Einstein-Hilbert action for 
matter, and its fnnctional derivative, i.e.. 

Tab := 2 . (C.8) 

ogab 

To see that these two possibilities are, in general, different, let ns consider Green and 
Wald’s example for both of them. By assnmption, there is an FLRW backgronnd metric 
5'afe(0) and a conformally related family of metrics: 

5 'ab(A) = G^(A) 5 fa 6 ( 0 ) , wliere lnr2(A) = AH ^sin 

The Einstein tensor bnilt ont of gabX)i he., Gab{,gX))i is related to the FLRW Einstein 
tensor Gafe( 5 '( 0 )) by a pnrely geometrical formula: 

Gab{g{\)) = Gab{gm - (2VaVGnG - 2f7,b(0)/‘'(0)VeVHnG 

-2(Va In G)(Vft In G) - gabX)g"\X (Vc In G)(In G)) . (G. 10) 

Green and Wald show (see (4.4) in [11]) that the effective stress-energy tensor in the 
weak-limit reads: 

tab{0) = -^GabigiO)) - Tab{0) = ^ wHim (Gab(g(0)) - Gab(g(X)) , (CTl) 


- + sm Y + sm - ) . (G.9) 

A A A / 



No proof that backreaction is irrelevant 


39 


and that the backreaction fonnd by the GW prescription has a trace corresponding to 
a P = —|p flnid. 

Now we have to choose in which way we dehne the stress-energy tensor for 
0 < A < 1. 

On the one hand from (i) : 

87rT,b(A) = G,fe(f7(A)) , (C.12) 


into which (C.IO) has to be snbstitnted, while on the other hand from (ii): 


T,fe(A) = O-Xb(O) . 


(C.13) 


The reason for the transformation rnle (C.13) is conformal invariance of the matter 
action: 

^(A) = j £„(A)v/=^d"a; = j £^(0) V'^^d^x = ^(0) , (C.14) 

where the Lagrangian densities transform according to 

P^(A) = , (C.15) 


so that (C.8) gives 


yab(A) 




n 


-4 


<^gabW 
dgcd(O) 


\/—g(0) dpabi^) Sgcd{0) 


^-g{0)C^{0) 


(C.16) 


lowering of indices nsing gaeW and 5 'f,/(A) gives (C.13) (see, e.g., [144]). In (C.14) and 
(C.16), a non-snbscripted g indicates the determinant of the metric. 

To show that case (i) is in general incompatible with case (ii) we proceed as follows. 
Let us assume that for A = 1, the metric and stress-energy tensor represent the ‘real’ 
Universe or, as in the case presented, the toy universe model that we wish to average. 
Dehne / := f2(A)D“^(A = 1). 

We have for case (i): 


G'afe(5'(l)) — 87rTaft(l) ; gab{^) — D^(A — l)5fa6(0) ; gabW — Al^(A)5'afe(0) , 


which implies that 

gab{X) = n\x)n-\x = l)gab{l) . (C.17) 

Thus, gab{X < 1) and 5'afe(l) are conformally related via gab{X) = pgab{l), since / is a 
ratio of exponentials of smooth real-valued functions (see (C.9)), and thus smooth and 
real-valued. 

For case (ii): 

TabiX) = f-^Tabil) (C.18) 

Gab{g{X)) = Gab{g{l)) - (2VaVCn/ - 2gab{l)g^\l)VcVd\n f- 

2iVa In f)iVb In /) - ^,,(1)/"(1)(’V, In /)(’V, In /)) , (C.19) 
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where the covariant derivative V is compatible with the metric Qabi^) and the geometrical 
formula (C.19) is obtained directly from (C.IO), and thus is also valid in case (ii). 
Combining (C.18) and (C.19) and imposing the Einstein equations for A > 0 we obtain: 

8vrT,b(l) = (2V„Vfeln/ - 2^7,6(l)/"(l)VeV,ln/- 

2(VJn/)(V6ln/) - ( 7 ,b(l)/''(l)(Veln/)(V,ln/)) , (C.20) 

which is a contradiction since the left-hand side is hxed and the right-hand side depends 
on A (lengthy but easy calculations show that the dependence on A does not cancel). 

We conclude that, within the GW formalism (as their explicit example shows), it is 
not in general possible to build a family of metric-dependent tensors using the method 
(ii) that obeys the Einstein equations. This gives us a strong indication of the nature 
of the Tab (A) family obtained with method (i). 

To see this explicitly let us look at GW’s choice (i): using the purely geometrical 
relation (C.19) and the Einstein equations for A > 0, we have^^: 

87rTab(A) = 87rTab(l) - ^2VaVbln/ - 25(afe(l)/'’*(l)VcVdln/- 

2(Valn/)(Vbln/) - ( 7 ab(l)/"(l)(Valn/)(Vdln/)) . (C.21) 

Now we take the weak-limit of both sides. 

The weak-limit of Tab{g{^)) is (where is a test tensor held): 

w-hmTab(^(l)) = j d^x^y-g{0)Tab{g{l))r^ , (C.22) 

which we can naturally associate with an averaged stress-energy tensor usually denoted 
(Tab) and denoted Tab(O) by GW (according to GW: “we may interpret Tab(O) as 
representing the matter stress-energy tensor averaged over small scale inhomogeneities”, 
[9] p. 13). Since the weak-limit of the remaining terms on the right-hand side does 
not disappear, we see that w-hmTab(A) 7 ^ Tab(O). As a consequence, in a more realistic 
case we should not expect Tab(O) to match the FLRW stress-energy tensor obtained by 
averaging the density inhomogeneities. It seems that the Tab(A) dehned via the Einstein 
held equations already contains a backreaction ehect and thus a part of backreaction is 
absorbed into the dehnition of Tab(O). In other words, forcing the existence of stress- 
energy tensors that obey the Einstein equations makes the weak-limit of this family of 
tensors distinct from the averaged stress-energy tensor. 

In summary, we are not aware of an example of a metric family satisfying the GW 
conditions that satisfactorily describes backreaction from matter inhomogeneities. 


In the arXiv preprint version 1 of this paper we erroneously referred to Eq. (C.18), as has been 
kindly pointed out in [22]. 
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